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Introduction; International Concerns 
and Controversies 



P, Piimfrey and P. Owen 



Summary 

The perceiwd importance of literacy in general and reading in 
particular is coiiinion in societies across the world. Despite / 
contextual differences, there are numerous common concerns 
and controversies. From these, three areas are identified. The 
first is developing an improved understanding of the nature of 
children's early reading development. The second is the consid- 
eration of ways in which children's reading can be encouraged, 
(These two areas are addressed in Volume 1,) In this volume 
issues of curriculum and assessment in the context of profes- 
sional accountability are addressed. 



Mow much is the ability to read worth? To be illiterate in most contem- 
poraiy societies is to he marginalized and disadvantaged. Anyone who 
arrives in a countr>' where lack of knowledge of the language prevents 
their reading the most basic written signs, will appreciate the meta- 
phorical imprisonments attributable to their ignorance of the language. 
Literacy is both a contributor to, and an amplifier of. human abilities. 
To argue that, in the era of information technology, learning to read is 
preparing children for the nineteenth, rather than the twenty-first cen- 
tury, is to misunderstand the nature of human thought and its develop- 
ment. Literacy liberates. 

In countries across the world, standards of literacy and the processes 
imderpinning them are of central interest to politicians, parents, profes- 
sionals and pupils. The editors and contributors to this series are well 
aware of the controversial, complex interrelated and changing nature of 
views held concerning the receptive and expressive aspects of language 
involved in the emergence, development and assessment of literacy, 
(Xir focus on reading does not deny the importance of other modes of 
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language. Reading is but one facet of literacy; literacy merely one 
component of communication and communication one part of cliild 
development. 

Optimizing pupils' reading attainments depends cmcially on pro- 
fessionals' understanding of child development and the conditions that 
facilitate reading as a thinking process. In this there are reciprocal rela- 
tionships between advances in theory, research and practice. Mutual 
benefits are likely to accrue when professionals from different coimtries 
and disciplines are able to identify' literacy-related issues of common 
concern and share experiences of promising developments. For all 
teachers, knowing why we use particular methods and materials and 
their effectiveness in specified circumstances, integrates theory\ its 
applications and their evalution. 

Moves towards an interactive model integrating the unjustifiably 
polarized 'top-down' versus 'bottom-up' positions concerning the nature 
of emergent reading, its development, teaching and assessment are 
taking place. In respect of the first two of these. Chapter 1 in Volume 
1 provides a constmctive synthesis. This does not mean that important 
controversies do not continue. In the advancement of knowledge in all 
fields, the dialectic involving hypothesis, antithesis and synthesis is ever 
alive. The liveliness of the continuing debate on, for example, emergent 
and developing reading is immediately apparent when one compares 
the stances represented in Chapters 6 and 7 in Volume 1. The same is 
true in relation to assessment issues addressed in Chapters 7 to 12 in 
Volume 2. 

In considering the merits of the cases presented in each of the 
chapters in both volumes, it is important to identify the author's implicit 
or explicit assumptions concerning the nature of reading, its develop- 
ment, teaching and UvSsessment (for example. Volume 2, Chapter 2). 
The theoretical coherence of a contributors case can also be consid- 
ered in relation to the quality and extent of evidence adduced in support. 
Further, in the interests of reaching a balanced judgment, a considera- 
tion of the contributions made by different authors based on alternative 
theoretical stances and from different professional specialisms cannot 
be ignored. 

Some of the chapters are based on researches carried out over 
many years and with considerable numbers of subjects: others report 
recent findings from smaller-scale studies. Some chapters are descrip- 
tive of what is deemed promising practice. 

With confidence, we assert that no individual has a freehold on 
validity. The words of Bacon continue to give important messages. 
'Read not to contradict and confute, nor to believe and take for granted. 
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nor U) find talk and discourse, but to weigh and consider' (Bacon 1561- 
1626: Essay 50 'Of Studies ). 

There are three major common concerns of those professionals 
involved in how children become literate and by what means such 
achievements can be appraised. These are: 

• developing understanding of the nature of children's emergent 
reading; 

• considering ways in which children's emergent and subsequent 
reading can be developed; and 

• the assessment and improvement of reading processes and 
standards. 

From these extensive fields, evidence drawn from the work of 
colleagues in many countries suggests that, within these concerns, shared 
imdcrstandings are gradually emerging from research and practice. 
Inevitably, there are also ongoing theoretical controversies that have 
important implications for practice in the classroom. The top-down' 
versus 'bottom-up* theories of reading development exemplify one 
controversy that appears to be nearing a resolution in an interactive 
model. These (inevitably partial) understandings cannot be ignored if 
v^'e are to increase our ability to conceptualize, control and optimize 
the devek)pmenl of children's standards of literacy in general and read- 
ing in particular. 

There is no claim that collectively the specific topics addressed by 
contributors represent a comprehensive co\'erage of critical issues. 
H(nvever. to remain unaware of the work being done in countries other 
than one's own would be irresponsibly insular. There is a growing 
consensus that, irrespective of the countr\', culture or language, the 
topics identified above merit inclusion in both the initial training of 
teachers and in continuing professional development. 

This series is distinctive on the combined basis of four major counts. 
Firstly, it is internationally oriented. It provides a somewhat overlooked 
international perspectives on the three issues identified above. Evidence 
drawn from the following countries is presented: Australia; Canada: 
Denmark; England: France; (lermany; Greece; Guam; Hungaiy; Israel; 
Italy; Jamaica; Japan; New Zealand; Northern Ireland; Scotland; Spain; 
and the USA. Reports on reading attainments across the thirty-two school 
systems and twenty-one language groups included in the International 
Association for the Evaluation of Educational Achievement (lEA) reading- 
literacy study are also reported and discussed. 

Secondly, it is inlerdisciplinarv'. Professionals from complementary' 
fields describe promising developments from their respective viev^ poinls; 
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teachers; leacher-trainers; psychologists: advisers; inspectors; adminis- 
trators; statisticians; and research workers. 

Thirdly, il contains messages for teachers and mentors concerning 
their regular work with pupils on encouraging literacy. In relation to 
improving and assessing reading, it combines what can profitably be 
done with why this is the case. 

Fourthly, it addresses international issues of accountabiliry\ Research 
studies and promising classroom practice from around the world are 
reported highlighting implications for the design, implementation, im- 
provement and evaluation of reading programs. 

With the increasingly multicultural character of societies across the 
world, the two volumes are planned to appeal to an international 
readership, although predominantly in English-speaking countries. It is 
expected that the contents of both books will be of interest, albeit 
differentially, to the following groups. 

• teachers in mainstream primary' schools; 

• teachers in special schools and units; 

• LEA ad\'is(>ry and support staff; 

• educational and child psychologists; 

• students on initial teacher-training courses; 

• teacher-trainers and school-based mentors; and 

• research workers. 

The strength of the two volumes is that they bring together, under 
the three international concerns identified, the work of professionals in 
different coimtries. Volumes 1 and 2 provide complementary informa- 
tion from colleagues with similar professional concerns working in 
different cultural contexts. The aim is to build bridges between theory, 
research and practice. 

The genesis of this two-volume series derives from the editors* 
longstanding in\'ol\'ements in seeking to understand more fully, and 
therel')y improve, the learning and teaching of literacy in general and 
reading attainments and progress in particular, of pupils in schools. Our 
work as teachers in mainstream secondar\' and primaiy schools, special 
.schools and units, language and reading specialists in support ser\iccs. 
research workers, academics and authors provides the basis for our 
involvements. Our contributions to initial training courses taken by 
teachers, coupled with the pro\'isioti of courses of advanced train- 
ing for cjualified and experienced teachers, underline our personal 
commitments. 

Over inany years, our activities have led to the establishment of 
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extensive professional contacts with colleagues in many countries en- 
gaged in similar work. Reflecting on our wide network of contacts and 
on common professional concerns and controversies, led to the present 
two volumes. 

In the twenty-four chapters comprising this series, we have pre- 
sented a selection of articles that provide information on research and 
practice. Each book is in two parts. Part 1 in Volume 1 concerns the 
importance of phonological awareness. Pan 2 addresses wider concerns 
related to the dewlopment of children's reading. In Volume 2, Part 1 
focuses on curriculum concerns and Part 2 deals with aspects of ac- 
countability and assessment. These contributions bear on some of the 
most important current concerns and contro\'ersies from the broad fields 
constituting emergent and developing reading and curriculum and 
assessment issues. Each of the chapters has a common structure. All 
authors have identified from their contributions a number of important 
'messages for teachers*. 

In Volume 2, the six chapters comprising Part 1 continue with a 
range of curriculum concerns. These include considerations of what is 
meant by literacy in different cultural contexts and demonsSirate changes 
in current thinking about the nature of the concept and the materials 
and methods likely to enhance the acquisition of literacy. Tliis affects 
how children's learning can be enhanced in culturally diverse com- 
munities. The two final chapters in this section return to a perennial 
contro\'ersy concerning ways in which the writing systems in different 
countries may affect standards of literacy. The six chapters in Part 2 are 
devoted to assessment issues. The pioneer work being carried out at 
the Tniversity of Manchester School of I'ducation, that in progress at 
the Royal Melbourne Institute of Technology, and the work of the 
Ministry- of Education and Culture in Israel, are arousing increasing 
international interest. The section concludes with two chapters based 
on the work of the International Association for the Evaluation of 
Educational Achievement reading-literacy study. 

In conclusion, we know, but find unconvincing, the reasons for 
non-participation by the l^K in ihis major international research. Did 
ro///* country take pan in the lEA reading-literacy study? If not, do you 
know why not? How true is it that ignorance can provide a (misguided 
and short-term) bliss? Perhaps fear of international comparisons of the 
educational efficacy of educational systems in promoting literacy will 
remain a central international concern for a long time. 
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Expanding the Dimensions of 
World Literacy 



C Foley 



Summary 

The ultiniale goal of reading instruction should be to produce 
young people who can read fluently and who choose to do so 
for both pleasure and information. It is argued in this chapter 
that expanding the dimensions of world literacy to make it work 
better for more learners will best be facilitated by singling out 
and addressing facets of reading instruction which make for 
enjoyable literacy activities. The importance of making reading 
instruction enjoyable must remain at the centre of the world- 
literacy movement as its dimensions expand in an effort to 
understand better not only how children learn to read but why 
children choose to read. 



Introduction 

Fxamination of educational programmes and practices should provoke 
professionals to ask, AVhy would children enjoy reading instruction?' 
The ultimate goal of reading instruction should be to graduate young 
people who can read and write fluently and who choose to do so in 
order to improve their lives personally, as well as politically (e.g., im- 
prove their status and influence). Yet, the current worldwide pattern of 
teaching and assessing and then, based on some standardized criterion, 
determining whether literacy has been achieved, often ignores the 
inipoitance of children's interest in, and enthusiasm for, literary activities. 
Past and present efforts by nations to achieve minimal levels of 
literacy for the majority of their populations have focused on the peda- 
gogy of numerous years of structured, redundant reading and writing 
instructional methodologies often delivered via boring and inappropriate 
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reading texts. The results have been unsatisfacto^>^ Students who do 
eventually 'measure up' on a skills test established for accountability 
purposes can still fail the true test of literacy by avoiding reading, 
writing and learning activities following their years of formal schooling. 
For students truly to enjoy language endeavours, enjoyable experiences 
and growth must occur early in life, during the very process of learning 
to read. Then, as adults, individuals will be more likely to participate 
in literary activities daily, despite busy home and work schedules. 

Educators, through dialogue, continuously examine and expand 
the dimensions of world literacy. Past focuses have included finding 
ideal instructional approaches and developing effective teacher-training 
programmes. Establishing national standards, determining key skills 
to be measured and designing tests to assess proficiency levels are 
additional areas that have been given considerable attention. While the 
manner in which comparisons will be made amongst students in New 
York and Chicago, New South Wales and Queensland, England and 
Germany, forms an important foundation in the examination of, and 
quest for, world literacy, it is only one facet of the international move- 
ment. Great progress has been made in the study of how children learn 
to read; yet, new questions and focuses have also emerged, causing the 
dimensions of world-literacy dialogue and research to expand. Three 
examples of interesting issues related to reading materials and instmction 
that prompt further exploration are the use of: multicultural literature 
which mirrors the faces in the classroom; reading materials which 
equitably and realistically portray males and females; and reading in- 
stRictional approaches which interest and motivate young learners. 

In expanding the first dimension of world literacy, which pertains 
to tilt/ use of multicultural literature in the classroom, the ethnic rep- 
resentation in children's reading materials must consider the ethnic 
backgrounds of students in the classroom. All students need opportun- 
ities to read a vast array of texts to which they can relate culturally in 
order to dev elop positive self-perceptions and strong personal identi- 
ties. Required reading materials should include selections with characters 
from various ethnic groups who are depicted in realistic settings and 
situations if children's understanding of the many cultures that surround 
them is to be strengthened. When reading materials used in the classroom 
Favour one ethnic group over the numerous other student populations 
present, students from the minority, non or underrepresented cultures 
may be unable or unwilling to relate to the characters in the texts. 
Limited ability or desire to relate can ultimately result in k)wered interest 
and ccMiiprehension. Negative, subliminal messages concerning societal 
values are sent to all readers, as well. 
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Have instructional reading materials changed over the years to 
better mirror the faces in today's classrooms? Basal-reading programs 
that have dominated reading instruction in American schools for over 
seventy years have influenced the cognitive and affective development 
of youngsters, causing them to reflect at times upon their very self- 
worth (Heymsfeld, 1989). vSmiih (1986) recognized that pupils who do 
not progiess satisfactorily in these popularly used reading programs 
probably experience difficulty due to a mismatch between their experi- 
ential backgrounds and the prior knowledge required to comprehend 
the texts. In other words, students who were at one lime categorized 
as poor achievers are now described as limited in their experiential 
background knowledge. Research demonstrates that students rely on 
their cultural backgrounds as well as on the cultural points of view 
presented in passages to comprehend a text (Priichard, 1990). Shannon 
n989b) suggests that basal-reading programs and standardized reading 
measures have fed a dual American myth, that minority and low socio- 
economic groups of students are responsible for the difficulties that 
they experience in learning to read while children from middle and 
upper-class homes are successful in learning to read because they are 
able to complete the basal program and pass the standardized achieve- 
ment measures administered. 

In American schools, shifts in demographics over time indicate that 
significant changes in the ethnic composition of the classroom have 
occurred. The prediction is that soon one of every three American 
children will come from an other-culture or non-English speaking home. 
Furthermore, the minority school-age population will increase from 2S 
per cent noted in the i980s to 42 per cent during the 1990s (Commission 
on Minority Participation in Education and American Life. 1988). In 
large urban areas. 80 to 90 per cent of the American school-age popu- 
lation will be non-white (Astin. 1982; Gonzalez, 1990). 

A second dimension related to the use of basal-reading materials 
that needs further attention is the large discrepancy in the portrayal 
ratio of females to males, a discrepancy which has existed for ox er four 
decades. In a 1977 .study examining the sex of central or main characters 
in 19(>}-7() basal readers (Racism and Sexism Resource Center for Edu- 
cators. 1977). 70 per cent of the basal characters were males. In another 
1970s study examining the freciuency of females and males appearing 
in 134 textbooks in stories with mastery themes (stories in which the 
central characters displayed qualities suih as ingenuity, creativity, 
braver\\ perseverance, achievement, adveniurousness. curiosity, auto- 
nomy and self-respect) the ratio of text characters was similar: one 
female to three males (Women on Words and Images, 197S). However. 
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stories with themes of passivit>' and dependence featured females six 
times as often as males, even when the females depicted could have 
been portrayed as appropriately handling the stor>' situations presented. 
Females were featured primarily as mothers or job holders, but rarely 
as both in the instructional materials being used in the classroom. 

Students at all grade levels need to be immersed in texts in which 
females and males are equivalently represented and realistically por- 
trayed. Reading materials must depict the ever-changing, expanding 
and divergent '•')les of males and females, as well, in order to improve 
human relationships. The sexist role conditioning of earlier days must 
not be encouraged or reinforced in the 1990s through reading textbook 
representations of females and males or the ultimate result will be 
further divisiveness. Females deser\'e to be portrayed in instructional 
texts as leaders, decision makers and problem soK'ers. Girls should 
appear as often as boys in main-cliaracter roles and in a variety of story- 
structures. 

Students need to be motivated to read. Academic achie\'einent is 
influenced by interest. Most school subjects are taught \'ia textbooks. 
Thus, the appeal of text selections is crucial. Students are able to under- 
stand and retain subject matter when they identify with the characters 
and the events. Most importantly, students are more interested and 
involved in reading selections where characters appear to be real to 
them. In a study conducted by Mc:Arthur and Fisen (1976), students 
were found to stay on task longer and remember stories of achievement 
when the characters were of the same sex as the reader. In addition, 
the students were most likely to accept a female character in a stoiy to 
be achievement-oriented if they had previously read a selection in which 
a female achieved. The more exposure the students had to non-sexist 
materials, tlie more they were influenced by, and able to retain, the 
values and attitudes of the materials. Despite such findings, society and 
textbook publishers continue to practise the \\ide use of male role 
♦models in books, television and other media. 

A third dimension of literacy which needs to expand further is that 
of classroom reading instruction. A wide repertoire of instructional 
strategies needs to be utilized by teachers to stimulate students to learn 
and to involve them more actively in the reading process. If years of 
instruction are indeed necessary to reach maturity in reading, as has 
been suggested by the 'basal kindergarten through eighth grade' in- 
structional approach, then efforts to maintain student interest over time 
and to adjust instruction for those who do not experience success, must 
be made. American basal programs have relied totally upon simplistic, 
instructional routines in the past, rather than capitalizing on novelty 
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during daily lessons. Two facts which are not always considered are 
that learning occurs when information is presented via new or unusual 
techniques and some students are more difficult to motivate than others, 
Reading instaiction cannot continue lo require students to partake in 
redundant exercises over lengthy periods of time, as has been the case 
in the USA with the daily, nine-year regimented use of the directed- 
reading lesson (Betts, 1946). 

Guam, a USA territory in the Pacific, struggles with the reality of an 
island-wide, mandated basal-reading instructional approach which is 
not meeting the needs of their unique American school population — 
a population which is not predominantly Caucasian middle-class, not 
predominantly male and not always challenged by the traditional basal- 
reading le.s.son. To better understand the research presented in this 
chapter, a brief histor\' of Guam's educational system follows. 

Guam's Educational System 

Guam became a possession of the United States in 1898. Previously, the 
Spanish provided education to the children of Guam in their native 
language of Chamarro. Under USA control, the public educational sys- 
tem w^as Americanized and all classroom instruction was conducted in 
Engli.sh. Instruction changed a third time with the Japanese occupation 
of Guam during World War II: during this time span, lessons were 
conducted in Japanese. The American system of education was rein- 
stated upon Guam's liberation from Japan in 1944. The current political 
leaning, toward becoming a Commonwealth, could potentially result in 
yet another instructional swing for this small island population. 

Presently, as a territory- of the USA, Guam follows federal regula- 
tions regarding education. Classroom practices reveal a purely Ameri- 
can instructional methodology. In reading, the basal approach, which 
lias long dominated reading instruction in mainland .schools (Flood and 
Lapp, 1986), is also the adopted, island-wide instructional program in 
Ckiam. Similarly, since the American public demands the administration 
of national, standardized reading assessments in the belief that the 
success of a school is retlvcted by the placement ot its students on such 
measures, standardized achievement tests are also administered to 
Ckiamanian students (Shannon, 1989a). However, the large number of 
students at risk' in the middle and high schools and the consistently 
low scores of local students of all ages on mainland-normed measures 
have caused speculation that CJuams educational sy.stem is sofnehow 
sub.standard' compared with that of the continental USA. 



C. Foley 



Guam's deflated standardized test scores may be attributed, at least 
in part, to the use of the American system of basal-reading instruction 
and standardized assessment, both of which are insensitive to the ex- 
periential backgrounds of Pacific islanc-^rs and Asians, small minority 
groups within America's contiguous borders but majority populations 
in Guam. Instructional practices advocated by the basal publishers might 
ignore the learning styles of the local students or their needs to become 
mature readers. Educators, such as Spencer (1990), have criticized the 
rigorous marketing efforts of publishers in the Micronesian region to 
sell instructional programmes to Guam and surrounding island school 
districts which are obviously 'inappropriate' because they are not geared 
to these unique populations of students. 

To grasp the multicultural mix in Guam and the male-female ratio, 
demographics are presented for the 1990-1 middle-school population. 
The demographic breakdown of this student population of 6,09S re- 
veals that the indigenous Chamarro student population was the largest, 
with 3,145 Chamarros from Guam and 129 from nearby islands (twenty- 
nine from Rota, twenty-one from Tinian and seventy-nine from Saipan). 
The second predominant ethnic group in the sixth to eighth grades 
were Filipinos, with 1.648 students. The remaining ethnic groups iden- 
tified included: 580 Caucasians, ninety-.seven African-Americans, forty- 
five Koreans, thirty-five Chinese, twenty-one Japanese, seventeen 
Hispanics and fifteen Native American Indians. In addition. 19i stu- 
dents came from the Federated States of Micronesia. 100 from Belau 
and thirty-one fr(;m other surrounding islands. Of the remaining fifty- 
nine middle-school students, forty-six were from 'other' Asian countries 
and thirteen from other* regions of the world. Guam's middle-school 
population is almost evenly divided with 3.517 males and 3,478 females 
(Department of Education. 1991). 

A reasonable expectation would be for the characters in the instruc- 
tional reading texts used in Guam to be similarly distributed by ethnic- 
ity and sex to the population serv iced for the best possible educational 
scenario to occur. In addition to noticeable problems with the character 
representation in the reading materials being used, classroom teachers 
in Guam also note reading instructional practices that are ineffective 
with their population. To address the three dimensions of world literacy 
targeted in this chapter, ethnic representation in reading materials, male 
and female representation in reading materials and reading instructional 
practices, a selection of recent studies are presented. All stem from the 
efforts of concerned and dedicated educators in the CUiam school sys- 
tem who are attempting to itiiprove reading comprehension and spark 
student interest during reading and c ontent-area instruction. 
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Method 

The first study discussed in this chapter focuses on Guam's unique 
multicultural middle-school population and the materials adopted for 
island-wide usage (Campbell, 1993). Campbell examined three seventh- 
grade basal-reading programs piloted in the middle schools to deter- 
mine how sensitive the programs were to the classrooms' ethnic mix 
and distribution of males and females. The seventh-grade basal texts 
were: Houghton Mifflin's Bright Glory /mm The Literature Experience 
P;'o^^mw(Pikulski et aL 1991), McDougal, Littell's Red Level from Vistas 
in Reading Literature CChaparro and Trost, 1989) and Silver Burdett 
and Ginn's Star Walk from World of Reading (Pearson et aL, 1989). 
The basal characters from all the stories in the three readers, with the 
exception of animal stories, poems and skills articles, were examined 
to determine their ethnicity and sex, the type of selection in which they 
appeared and tlie role that they played. 

Numerous studies have been conducted to examine instructional 
issues. Other studies shared in this chapter focused on the use of reading 
instructional variations which might stimulate student interest and 
increase coiiiprehension. The first study conducted by a fifth-grade 
instructor in a northern Guam elementar>' school was prompted when 
the teacher noted three basal reading activities which were largely 
ineffective with the Chaniarro and Filipino students in her classroom 
(Foley, Farra and Chang, 1991). These activities included the initial 
silent-reading seatwork task, the completion of a large number of seat- 
work pages and the written completion of the 'Focus on Comprehen- 
sk)n' (juestions found at the end of each selection. The instnictor decided 
to streamline the weekly basal lesson and replace these activities with 
taped repeated reading (Chomsky, 197(^) and paired repeated reading 
(Koskinen and Blum, 1986) for one-half of the class who became the 
experimental group. The control group continued to receive the regular 
basal program, as they had throughout their elementary school careers. 

In a similar study, middle-school science teachers also experimented 
with the taped-reading strategy in an effort to coax and to assist their 
reluctant and remedial readers to complete content-area homework 
reading assignments. To encourage their students to read the required 
text, these teachers made uidio tapes of the assigned textbook pas- 
sages and tlien made them available for clieckout by the students. As 
a result, taped reading sessions were possible in each student's home. 
The (juestion posed by these content-area teachers was a simple one: 
Would the students complete the reading homework assignments if 
provided with an instructional support system? 
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At the high-school level, McDonald (1993) experimented with a 
similar instructional variation, 'taped read-along\ to teach literature. In 
addition to the taped read-along variation, McDonald also implemented 
*story impressions' (McGinley and Denner, 1987), an instructional 
prereading strategy which utilizes prediction and links the reading and 
writing processes, to get her secondary students more actively involved 
in the reading process. Once again, the research of a local classroom 
teacher examined alternative instructional strategies that might appeal 
to her unique mix of students. Like other Guam educators who were 
choosing to vary the instructional classroom approach to reading tasks, 
McDonald recognized a need to increase student comprehension of 
required textbook readings. In her study, three groups of students, 
deemed to be equivalent, based on findings from the Kruskal-Wallis 
ter,t used to analyse the comprehension pretest scores of the groups, 
were assigned to one of three reading instructional approaches for a 
five-week period during which five literature selections were read. The 
control group received the traditional high-school instructional approach 
as detailed in the teacher's manual; the students read the selection 
silently and then answered the textbook comprehension questions. 
Students in treatment group two (taped read-along) listened to a taped 
recording of the literature selection and read along in their books. They 
then answered the comprehension (juestions. Students in treatment 
group three (story impressions in a taped read-along format) discussed 
what the story might be about using the given story clues, wrote a story 
guess, listened to the taped recording of the literature selection while 
reading along in their books, compared/contrasted the story guess with 
the actual story and then answered the comprehension questions. Each 
of the three groups rated how much they liked or disliked the strategy 
used weekly at the completion of the instmctional activities. 

Not all the variations to traditional and often repetitive classroom- 
reading instRiction implemented in Guam's classrooms by educators 
who felt that learning could be more enjoyable and more effective 
employed the taped reading element. A middle-school teacher 
(Bismonte, Foley and Petty, in press) substituted a single instructional 
component of the basal directed-reading lesson, the vocabulary activities, 
with 'possible sentences' (Moore and Moore, 1986). This is a strategy in 
which students make predictions about the relationships between un- 
known words and then use the text to evaluate and refine their vocabu- 
lary knowledge. One group of students, the control group, continued 
to receive the traditional basal instruction, while the others. Who became 
the experimental group, received the modified basal approach. Only 
differences in vocabulary were examined in this research design. 
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To improve students' suminary writing and comprehension of sci- 
ence text, a fifth-grade teacher in a southern Guam school (O'Mallan, 
Foley and Lewis, 1993) compared the use of the guided reading pro- 
cedure (Manzo, 1975) with the traditional content-area instructional 
approach. Pupils in the experimental group were taught to recall orally 
a science text read silently and to then confirm, organize and note 
relationships amongst the recalled facts prior to writing a summary of 
the text. At the same elementary school, a second instructor focused on 
social-studies instruction (Benito, Foley, Lewis and Prescott, 1993) 
providing metacognitive instruaion and practice in question-answer 
relationships (Raphael, 1982) to third, fourth and fifth graders to improve 
textbook comprehension. This instructional variation was a drastic 
deviation from the traditional basal-reading instruction received by 
the control group. In all the aforementioned research studies which 
examined traditional classroom instruction in search of more promising 
approaches to reading, the findings were promising. 

Results 

Results of the Campbell study related to the first diinension, ethnicity, 
revealed tliat characters in today's middle-school basal readers greatly 
resemble those found in basals a decade ago (Logan and Garcia, 1982), 
Column percentage scores (Table 1.1) permit the comparison of ethnic 
groups within a particular basal programme while row percentage scores 
permit ethnic comparisons across the three basal programmes included 
in this study, 

Tcnal row percentage scores in the Campbell study indicate that 
70.59 per cent of the 415 basal characters examined were Caucasians 
(living either within or outside the continental United States). The second 
ethnic group most often depicted in the current seventh-grade basals 
was the Hispanic (12.77 per cent). African Americans ranked third with 
approximately one-half as many characters (6.27 per cent). Asians (3.61 
per cent) and Native American Indians (3.61 per cent) comprised ver)' 
small percentages of the total characters examined in the three sets of 
textbook selections. Japanese, Chinese and Vietnamese characters rarely 
appeared in the seventh-grade texts, and not a single Pacific islander 
was identified in the reading instructional materials being considered 
for island-wide adoption by Guam educators. 

Row percentage scores reveal that of all the Hispanic characters 
found in the three programmes, more than one-half, 56.60 per cent, ap- 
peared in the McDougal, Littell series. The Houghton Mifllin programme 
had tlie least number of Hispanic characters (15.10 per cent). McDougal, 
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Table 1.1: Ethnicity of characters presented in three middle-school basal-reading series 



. Basal-reading Series 



Ethnicity 




Houghton Mifflin 






McDougal, Littell 




Silver Burdett and Ginn 




Total 




N 


Col. 


Row 


N 


Col. 


Row 


N 


Col. 


Row 


N 


Row 






% 


% 




% 


% 




% 


% 




% 


Total Caucasian* 


75 


72.12 


25.60 


133 


71.12 


45.39 


85 


68.55 


29.01 


293 


70.59 


Caucasian in USA 


36 


34.62 


21.30 


90 


48.13 


53.25 


43 


34.68 


25.44 


169 


40.72 


Caucasian out USA 


39 


37.50 


31.45 


43 


22.99 


34.68 


42 


33.87 


33.87 


124 


29.88 


African American 


12 


11.54 


46.15 


8 


4.28 


30.77 


6 


4.84 


23.08 


26 


6.27 


Hispanic 


8 


7.69 


15.10 


30 


16.04 


56.60 


15 


12.10 


28.30 


53 


12.77 


Native American Indian 


3 


2.88 


20.00 


11 


5.88 


73.33 


1 


.81 


6.67 


15 


3.61 


Chinese 


3 


2.88 


60.00 


0 


0.00 


0.00 


2 


1.61 


40.00 


5 


1.20 


Japanese 


0 


0.00 


0.00 


1 


.53 


16.67 


5 


4.03 


83.33 


6 


1.45 


Vietnamese 


0 


0.00 


0.00 


2 


1.08 


100.00 


0 


0.00 


0.00 


2 


.48 


Asian Other* * 


3 


2.88 


20.00 


2 


1.07 


13.33 


10 


8 06 


66.67 


15 


3.61 


Pacific Islander 


0 


0.00 


0.00 


0 


0.00 


0.00 


0 


0.00 


0.00 


0 


0.00 


Totals: N and Col. % 


104 


25.06 


187 


45.06 




124 


29.88 


415 





Notes: * Total Caucasian = Caucasian in USA + Caucasian out of USA. 

*• Thai (8); Mongolian (3); Indonesian (2); Iraqi (2). 
Source: Campbell. E.D. (1993) 
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Litiell had a relatively high representation of Native American Indians 
(73.33 per cent), as well, in comparison to the other two basal pro- 
grams, Houghton Mifflin (20.00 per cent) and Silver Burdett and Ginn 
(6.67 per cent). Of the African Americans identified in the basal selec- 
tions, 46.15 per cent appeared in the Houghton Mifflin reader; Silver 
Burdett and Ginn had the least number of African American characters 
(23.08 per cent). The Chinese were not represented at all in the 
McDougal. Littell program. Likewise Japanese characters never appeared 
in the Houghton Mifflin reader. McDougal, Littell was the only pub- 
lisher to include Vietname.se characters in the seventh-grade ba.sal. None 
of the three series examined in the Campbell .study portrayed a Pacific 
islander in a single reading .selection. 'Other' Asian characters identified 
tended to be concentrated in a single ba.sal selection and were pre- 
dominantly Thai. 

Table 1.2 pre.sents results of the Campbell .study related to the .sec- 
ond dimension of literacy, the inclusion of female and male characters 
in the ba.sals. Table 1.2 shows that McDougal, Littell had more .story 
characters (187) than the other two programmes. Silver Burdett and 
Ginn ( 124) and Houghton Mifflin ( 104). Thus, the number of characters 
for both .sexes in McDougal, Littell was greater than that of the other 
two reading .series. Yet, the McDougal, Littell reader also di.splayed the 
large.st di.screpancy of males ( 135 characters or "^2. 19 per cent) to females 
(fifty-two characters or 27.81 per cent). Overall, percentage scores 
showed that males appeared as story characters much more frequently 
((>9.64 per cent) than females (30.36 per cent) in the ba.sal .selections 
examined. 



Table 1.2: Sex of characters by basal-reading series 











Basal-reading Series 








Sex 


Houghton 


McDougal. 


Silver Burdett 




Total 






Miffltn 




Littell 


and Ginn 








N 


% 


N 


% 


N 


% 


N 


% 


Males 


71 


68 27 


135 


72.19 


83 


66 94 


289 


69.64 


Females 


33 


31 73 


52 


27.81 


41 


33.06 


126 


30.36 


Totals 


104 




187 




124 




415 





Source. Campbell. E.D. (1993) 



The Campbell study further a.s.ses.sed the treatment of the .sexes by 
categori'/.ing female and male characters by role: main character, sup- 
porting character or minor character (Table 1.3). Clolumn percentages 
reveal that of the 129 c haracters depicted in main roles, 105 (81.39 per 
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Table 1.3: Character roles by sex for the three basal-reading programs 



Sex 


Total 
N 




Main 




Character Roles 
Supporting 




Minor 








N 


Col. 
% 


Row 

% 


N 


Col. Row 
% % 


N 


Col. 
% 


Row 

% 


Males 
Females 


289 
126 


105 
24 


81.39 
. 18.61 


36.33 
19.05 


100 
58 


63.29 34.60 
36.71 46.03 


84 
44 


65.67 
34.38 


29.07 
34.92 


Totals 


415 


129 






158 




128 







Source Campbell, E.D. (1993) 



cent) were males while only twenty-four ( 18.61 per cent) were females. 
Even in supporting and minor character roles, males appeared twice as 
often as females. In supporting roles there were 100 male characters 
(63.29 per cent) in comparison to fifty-eight female characters (36.71 
per cent). In minor roles, eighty-four males (65.67 per cent) appeared 
in comparison to forty-four females C34.38 per cent). Row percentages 
indicate that across the three series, males appeared predominantly in 
main and supporting roles (36.33 per cent and 34.60 per cent, respec- 
tively) while females were cast most often in supporting and minor 
roles (46.03 per cent and 34.92 per cent, respectively). Females played 
the least important character roles in the McDougal, Littell basal pro- 
gram where they were identified predominantly in minor character roles. 
The final variable analysed in Campbell's study relating to the sex 
of the basal characters was the representation of females and males in 
the various stor>' types found in the seventh-grade readers: real-life, 
myths and legends, biographies and autobiographies and science fiction 
and fantasies. Findings as set our as Table l. i column percentages 
indicate that the greatest number of both male and female characters 
appeared in the real-life selections (44. (>4 per cent and 48.41 per cent, 
respectively). However, row percentages indicate that, overall, the real- 
life selections contained more than twice as many male characters (129 
or 67.89 per cent) as female characters (sixty-one or 32.11 per cent). 
The largest discrepancy between the sexes occurred in the myths and 
legends category' where males (sixty-five or 80. 2S per cent) were depicted 
four times more often than females (sixteen or 19.75 per cent). The best 
female representation was in the .science fiction and fantasy category 
where 39.02 per cent of the characters were females. However, even 
this strongest female representaticMi was not equivalent to that of the 
males, who appeared as characters in this story type 60.98 per cent of 
the time. 
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Table 1.4: Male and female representation in the four types of basal-reading selections 



Type of Selection 




Male 


Sex 


Female 








N 


Col. 


Row 


N 


Col. 


Row 


Total 






% 


% 




% 


% 




Real-life 


129 


44.64 


67.89 


61 


48.41 


32.11 


190 


Myths and legends 


65 


22.49 


80.25 


16 


12.70 


19.75 


81 


Biographies and autobiographies 


45 


15.57 


72.58 


17 


13.49 


27.42 


62 


Science fiction and fantasies 


50 


17.30 


60.98 


32 


25.40 


39.02 


82 


Totals 


289 






126 






415 



Source: Campbell, E.D (1993) 



Findings related to gains rele\'ant to the third dimension of literacy, 
varying classroom-reading instruction to improve learning, were positive. 
In the Foley, Farra and Chang (1991) study, significant gains in oral 
reading comprehension occurred following an eight-week implementa- 
tion of taped and paired repeated reading in place of three directed- 
reading lesson components considered to be ineffective with the 
Ouamanian students. Teacher observations revealed that the supple- 
mental reading strategies used with the experimental group in this study 
increased student interest, resulting in more effort being made during 
both the silent and oral readings of the basal selections. Students in the 
experimental group were described as more motivated and more on 
task during the taped and paired repeated readings than were the 
students in the control group who continued to receive and to avoid 
the three basal seatwork tasks included in the traditional instructional 
approach. 

The science teachers w ho encouraged their middle-school students 
to complete home textlx)ok-reading assignments by providing them 
with taped readings of the text assignments found this instmctional 
variation to be successful initially when students began to ask if tlie 
next tape was available e\'en l^efore the textbook reading was assigned. 
In addition, the number of audio tapes availal'>le for checkout had to Ix- 
increased to meet the growing student demand for the taped reading- 
homework component of the class. 

Results of the McDonald (1993) study that employed two in- 
structional variations to the traditional literature les.son, taped read- 
ak)ng and stor>' impressions in a taped read-along format, suggested 
that, while the high-school students responded favorably to all tliree 
strategies the first time they were used, fluctuations occurred in the 
students' interest in the approaches over time. lable l.S presents the 
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Table 1.5: Percentage of students who liked the reading strategy used (N = 89) 



Week 


Traditional Silent Reading 
% 


Reading Strategies 
Taped Read-along 
% 


Story Impressions 
% 


1 


61.54 


70.00 


73.33 


2 


25.00 


8.33 


15.38 


3 


38.46 


15.38 


40.00 


4 


14.29 


72.73 


42.86 


5 


23.08 


81.82 


25.00 


Average Rating 


32.47 


4965 


39.31 



Source: McDonald, F D. (1993) 



percentage of students in each of the three groups who liked the reading 
strategy used to teach the literature selections over a fi\'e-\veek period. 

A majority of the students in all three instructional groups initially 
rated the strategy used lo teach the first literature selection positively. 
However, ratings of all three strategies pluiiiiiieted by the second week 
of instruction. The traditional strategy received low-interest ratings for 
thv' remainder of the five-week study, 'Story impressions' ratings 
fluctuated but never regained the initial, positive response given after 
the first week of usage. The taped read-along strategy began to regain 
appeal follow ing two weeks of lowered ratings, receiving the highest 
percentage of positive marks from students during the last two weeks 
of instruction. The average rating of all three strategies indicated that a 
majority of the students were not favorable to any of the instructional 
approaches implemented in this paiticular study. 

To compare the comprehension of the three groups recei\'ing the 
same literature selections via different instructional approache,s, a one- 
factor analysis of variance ( ANOVA) was used. Mean scores of both the 
taped read-along group, treatment group two, and the story impres- 
sions in a taped read-along format group, treatment group three, were 
statistically significant at a probability lex'el of .0248 when compared to 
control group one who received the traditional instructional approach 
to literature, (See Tables 1.6 and 1.7), 

A post hoc analysis indicated that both instructional variations were 
significantly better than the traditional approach used in the classroom. 
However as can be seen from Table 1.8, a significant difference in the 
student-comprehension scores was not found to exist between the two 
treatment groups. 

The strongest finding in the O'Mallan, Foley and Lewis (1993) study 
was that the students, as a result of the guided reading-procedure in- 
structional x ariation, paraphrased more while simultaneously decreasing 
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Table 1.6: Mean comprehension scores and standard deviations for the three instructional 
strategies 



Group 


Mean 


S.D 


Control group 1 






Traditional strategy 


6.67 


1.67 


Treatment group 2 






Taped read along strategy 


7.76 


1.09 


Treatment group 3 






Story impressions strategy 


7.71 


1.90 


Source. McDonald. F.D. (1993) 

Table 1.7: Analysis of variance cf effectiveness 


of alternative reading strategies 





One Factor ANOVA . X. Group Y,: Comprehension Mean Score 



Source DP Sum Mean 

Squares Squares F-Test 



Between groups 4 25.90 6.47 2.95 (p = .0248*) 

Within groups 85 186.85 2.20 

Total 89 212.74 



♦ Significant 



Note: There were four treatment groups and one control group in the original study 
Source McDonald. F D. (1993) 

Table 1.8: Fisher PLSD post hoc comparison between groups 



One Factor ANOVA X,- Group Y,: Comprehension Mean Score 



Comparison Mean Difference Fisher PLSD 



Control 1 vs. Treatment 2 -1 09 1 07* 

Control 1 vs Treatment 3 -1 .04 98* 

Treatment 2 vs Treatment 3 - 05 1 07 



* p < .05 



Source. McDonald, F D (1993) 

tlioir tcndcncN' to plagiarize material read in their written summarie.s. 
And lastK'. following the cjuesiion-answer relationships nietacognilive 
instruction (Benito. Foley. Lewis and Frescoll. the experimental 

group in this study correctly answered a significantly larger percentage 
of the comprehension items on the end-of-chapier sodal-.studies test. 



Discussion 



The key finding of the Clampbell ( study was that approximately 
two-thirds to three-fourths of the characters depicted in the seventh- 
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grade basals were Caucasians, a minoril\' group in Guam's middle- 
school population. The ethnic groups found predominantly in Guam, 
especially the indigenous Chamarros and the neighbouring Pacific 
islanders, never even appeared in the basal texts being considered 
for island-wide adoption. The Campbell findings revealed a modest in- 
crease in the representation of Hispanic characters and a slight decrease 
in the number of .\iVican American characters appearing in the current 
middle-school basal texts. In the Campbell study of three se\'enth-grade 
basals. only 6.27 per cent of the characters were African Americans. 
Klebacher (1984) found that 13.00 per cent of the characters in four 
fourth to sixth-grade basal readers were .\lncan American. In Lc^gan 
and Garcia s ( 1982) study. Native American Indians comprised 3.-*0 per 
cent of the total basal characters. Siniilady. in the Campbell study con- 
ducted eleven years later. 3 61 per cent c^f the basal characters were 
native American Indians. Though new ethnic groups (Asians. Chinese. 
Japanese and X'ietnamese) emerged in the sex enth-grade basal readers 
of the 1990s, they appeared in a small number of selections. The fact that 
Pacific islanders and Asians were seldom found is notewortliN' consider- 
ing Pacific islanders and Asians have more than doubled in population, 
from 3.S million to ".3 million, according to the TvSA 1990 census. 

Relevant to the second dimension of literacy. Campbell fcnind that 
male.s were depicted in basal-reading selections at the seventh-grade 
level twt) to three times mc^re often than were females. Females are still 
seldom seen to be the problem solvers or stor\' focus in the instructional 
reading materials used in American classrooms, according to current 
findings in the Campbell study. Again, basal-publishing companies have 
ignored the findings and demands for change made by earlier researchers 
(Racism and Sexism Resource Center for F.ducators. 19 : Women on 
Words and Ullages. l^^^S). The large discrepancy in tlie ratio of females 
to males and the similar discrepancy in the frequency of females and 
males in main roles has continued into the I99()s. 

Relevant to the last dimensicMi. instructional variation, the novelty 
of the method of taped reading was effective (ox a lengthy period of 
lime at the elemenlaiy level during reading instruction and at the mid- 
dle-school level during science instruction. In the fifth-grade classroom 
(Tolev. Farra and Chang. 1991 ). gains were noted in comprehension for 
the group that used taped and paired repeated reading in place of 
instructional components of the basal felt to be ineffective with the 
Ckiamanian youngsters. In addition, student interest and willingness to 
stay on task during reading activites were also noted Ic^ increase as a 
result o\' the instructicMial change. At the middle-school level, students 
who previcHLsly had not completed science hcMiiework-reading tasks 
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were found to Ix^conie enthusiastic and positive when read-along tapes 
of the assigned texts were made available for overnight checkout. 

In literature classes at the high-school level, the taped read-along 
strategy was the most interesting to the students of the three approaches 
used to provide instruction of recjuired literature selections. Compre- 
hension of the literature selections increased for both alternative ap- 
proaches to the traditional classroom instruction, taped read-along and 
stor)' impressions in a taped read-along format. However, results of the 
study confirmed the suspicions of some Guam secondary' teachers that 
they have difficulty motivating liigh-school students to read assigned 
texts. While the students in the study initially responded favorably to all 
the strategies used, fluctuations occurred in their interest in the in- 
structional approaches throughout the fixe-week span of the study. 

The *possil")le sentences* strategy used in place of the vocabulary 
component of the directed reading lesson resulted in students' improved 
comprehension of the key vocabulary terms targeted for instruction 
(Bisnionte. Foley and Petty, in press). In addition to the students* in- 
creased abilities to pronounce and define the x'ocabular)' terms, obser- 
vations of the researchers and classroom teachers were that students in 
the experimental group were more interested during classroom instruc- 
tion and more likely to complete homework-reading assignments than 
were those in the control group. In this particular study, the traditional 
approach, if conducted as directed by the teachers manual, was also 
effective. 

As a result of the guided-reading procedure instructional variation, 
fifih-graders paraphrased more while simultaneously decreased their 
tendency to plagiarize material in written summaries (O Mallan, Foley 
and I.ewis, IW,-^). This instructional result should greatly benefit particip- 
ants in the study during their secondar\'-school years, when they will 
ix' reciuired to write numerous reports widiout plagiarizing texts. 

In the final study in which (|uestion-answer relationships and meta- 
cognition were employed to strengthen student comprehension of social- 
studies text ( Benito, Foley, Lewis and Prescott, IW.^), intermediate-aged 
pupils who averaged only one correct answer to the four items at the 
end-of-the-section comprehension chetk, doubled their performance 
on this measure following a five-week instructional period. 

In the past five years, in all cla.ssrooms in which teachers imple- 
mented variations in reading or content-area instruction, students have 
experienced comprehension gains. Children in Ciuam s elementaiy and 
secondary classrooms responded positively to a number of diverse 
instructional strategies which deviated from the traditional approach to 
reading. 
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Mess^es for Teachers 

Children spend an important part of their Hves in school. They learn a 
range of basic skills including those of literacy. They also formulate 
social attitudes and behaviours. Self-esteem, academic success and 
altitudes toward ethnic groups are influenced by what students do 
not read as well as by what they do read. Subtle and overt values are 
transmitted through the contents of reading texts (Jackson, 1944; Fisher, 
1965; Tauran, 1967; Litcher and Johnson, 1969). 

For educators in Guam, the results of Campbell's study on ethnic 
and sex representation of characters in current middle-school basal 
readers suggest that mandated reading materials may make no effort to 
portray characters who are similar to the population of students actu- 
ally using them. As a result, problems in areas such as comprehension 
and assessment could be expected — due to the mismatch between 
.student and character experiential backgrounds. In the affective domain, 
an alarming message is being sent to the children concerning the value 
of their cultures. Will Guam's young people be able to acquire a love 
of reading when exposed, for the most part, to story characters unlike 
them.selves? 

Internationally, educators need to continue to examine closely the 
instRictional reading materials being used in their classrooms to deter- 
mine if they promote or inhibit learning and literacy. Cultural diversity 
in Guam, or in any democratic society in which equality of opportunity 
is acknowledged as a crucial objecti\'e, has important implications for 
educational policies and practice. As the findings from the small-scale 
researches in Guam reported above suggest, important lessons can be 
learned by teachers from the practices and research of colleagues in 
other culturally diverse societies. For example, cultural diversity and its 
pedagogic implications across an entire curriculum, and in relation to 
literacy in particular, are being systematically addressed at national, 
.school and classroom levels in England and Wales (Fumfrey and Verma. 
1993a; 199.^b; Verma and Pumfrey. 199.^a; 1994). Are the materials used 
in your school's literacy program culturalh* sensitive? 

Research must extend further with the next cycle of ethnicity studies 
to include a new and ciuickly growing 'ethnic' population of interracial 
children. Interracial children form a growing group in today's classrooms; 
yet, this student population has seldom been included in ethnic-re.search 
endeavours. 

Society is changing and the portrayal of the sexes must also change 
in the school curriculum. Literacy, as defined by academic achiese- 
tnent, is curtailed for females by the unfair depiction of them in the 
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texts that they read. Stories that children are required to read must 
increase the number of females depicted in strong leadership roles. 
Today's females need achievement-oriented role models with whom 
they can identify. Males also must be presented in varied roles, free of 
the sexist labelling that occurs when they are cast in stereotyped roles. 
If the definition of literacy does expand to include fostering a love for 
reading and producing avid readers', then the need for immediate and 
drastic changes in the classroom materials used to teach reading will 
have to be addressed. 

Students* self-esteems, values, aspirations and fears are influenced 
by the extent to which they identify with the characters and life situ- 
ations experienced through print — especially if they become emo- 
tionally involved with the characters. Character identification leads 
students to mold their own behaviours after their model characters 
(Busch. 1972). If minority and female students cannot find characters 
and settings in stories uith whom, they can identify, their personality 
growth may become inhibited. They can also become disenchanted 
with school and ultiniately become academic non-achievers (Poussaint. 
1970). Reading materials used in schools worldwide should reflect the 
population of the local comnumity and of a world in which judgment 
is not based on sex or ethnicity, but on an individual's character. 

Educators, parents and children alike agree that learning to read is 
not an easy task for many pupils. Despite this, it is the primaiy facet of 
schooling where failure is unacceptable. In striving for world literacy, 
the definition of the concept must expand as the knowledge base and 
commitment to the cause grow. Hstablishing standards and then de- 
termining how to deliver instruction and measure success is only the 
beginning of the international quest for ecjuality and an educated 
citizenr>'. All pupils who enjoy instruction will enhance their learning 
potential and, once truly motivated, will choose to use the skills learned. 
Avid reading and writing will stimulate further growth in literacy skills. 

In establishing sound literacy goals for the 1990s, the foremo.st 
goal must be to instill in children a love for reading so that, ultimately, 
they will read. Hnthusiastic. lifelong learners need to be graduating 
from secondary institutions worldwide. Educators who recognize the 
value of literary' activities and make them enjoyable will have a much 
greater success rale than those who continue to view reading solely as 
a set of skills to be taught and mastered or a curriculum to be completed. 
The reality and commonality which bring international educators together 
is the fact that there can be found, in all nations, a group of unfortunate 
individuals who are either unable or unwilling to read and write. 
Strengthening the literacy levels of these students is perhaps tlie iiu)st 
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demanding challenge faced in today's classrooms. This might he hest 
accomplished by experimenting with instaictional variations which could 
improve both student interest and comprehension during reading and 
content-area mstruction. 



Conclusion 

Expanding the dimensions of world literacy to make it work better for 
more learners will best be facilitated by singling out and addressing 
facets which may be factors contributing to students' reading success or 
reading failure. The importance of making reading instaiction relevant 
and enjoyable must at all times remain at the centre of the world- 
literacy movement as its dimensions expand and more pieces of the 
literacy puzzle are examined in an effort to better understand not only 
how children learn to read, but why children choose to read and what 
makes children k)ve to read. 
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Children's Learning and the 
New English Curriculum 



B, Rahan-Bishy 



Summary 

This chapter reviews the main features of the reading conipon- 
enl of the new Fnghsh Curriculum and judges how far these 
will influence teaching and children learning to read. The na- 
ture of exidence on how children learn to read is explored. It 
is argued that research paradigms too often determine the re- 
search questions about reading rather than the reverse which 
should he the case. This situation causes problems in the Fnglish 
Curriculum when coupled with \'ested interests pulling in dif- 
ferent directions. Much work remains to be done to hold the 
giound between the different tensions and a\'oid a wilful dis- 
regard for theories of learning and their reality in the classroom. 



Introduction 

This chapter explores issues concerning children's learning within the 
context of their learning to read and then maps these on to the concept 
of learning which is becoming enshrined in the National Curriculum 
in Fngland through teacher training and becoming embodied in the 
rex ised draft Order for Knglish in particular. 

The contributions of Piaget. Cliomsky and Bruner to our thinking 
concerning how children learn to read have been ably demonstrated by 
Donaldson and Reid (1982). For instance. Piaget's notions (19SS) of 
'accommodation' and 'assimilation' pixn ide a powerful model for coun- 
teracting notions of learning being the stringing of beads. Through 
using these two concepts we are able to get a more sophisticated sense 
of new knowledge transforming old knowledge structures, and a more 
organic grasp of the notion of de\elopment. 
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Chomsky's work (1957) has given us insight into the way children 
acquire language and use their knowledge about language to become 
increasingly efficient in their ability to identify linguistic probabilities 
from hmited information. In addition to these powerful strategies, 
Bruners research (1957) has highlighted the concept of 'perceptual 
recklessness' and reminds us to temper ideas of predictability in lan- 
guage with due regard to the perceptual features of the spoken or 
written stimulus. 

Focus on Learning 

Concepts of learning and causes of failure to learn are central to the 
concerns of reading teachers, and what they know from their own 
work with children above all else is that we all fail to learn what doesn l 
make sense to us. The problem, as Smith (1986) has pointed out, is not 
one of not being able to learn, but rather not being able to make sense 
of what we are trying to learn. 

This belief comes from the experience that teachers are more 
effective when they make themselves understandable to the learner, 
no matter how little the learner knows. Because of this insight, interest 
has been focused on what has become known as 'prior knowledge . 
This view of necessar>- prior knowledge, is based on the fact that it is 
easier to understand what people sa>* or write when there is an existing 
understanding of what they are talking or writing about. 

Work in the 1970s, for instance, illustrated that children categor- 
ized as iow achievers' in school have limitations in their prior know- 
ledge rather than any defective learning abilities (Anderson, Spiro and 
Montague, 1977). Where does this prior knowledge come from? It is not 
from s)'stematic and specific instruction because that in turn recjuires 
prior knowledge. It comes from more general experience of the topic 
in which the learning is to take place. We learn by engaging in activ- 
ities, where we are helped to understand what is going on, and helped 
to engage in the activities themseh es. 

Smith (op. cit.), refers to experimental e\'idence (Mandler. 19()7) 
which illu.strates this process, demonstrating how understanding, or the 
activity of making sense, leads to learning rather than the deliberate 
effort to, for instance, memori/e. The activity which was investigated 
included three groups of subjects who were gi\'cn cards with single 
words printed on them. One group was asked to memorize the words 
for a later test, one group was asked to categorize the words in what- 
ever way they cho.se and to memorize the words for a later test, the 
third group were asked only to categorize the words. 
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Both groups wlio engaged in the categorization task did equally 
well in memorizing the words and did considerably better than the 
group asked to memorize the words alone. The activity of categorizing 
the words was a powerful means of imposing order and making sense 
of the activity. Indeed, in this investigation categorization was found to 
be far more salient in promoting learning than the nonseasical task of 
learning individual words. 

Learning has always been a difficult topic for psychologists to 
research. How can experiments compare between subjects who will 
inevitably have different prior knowledge? Towards the end of the last 
century, experimental psychologists discovered that they could avoid 
all this difficulty if they restricted their studies to how individuals learned 
nonsense or meaningless syllables and words. No one can have prior 
knowledge of something from outside their experience, something that 
makes no sense. With nonsense, as Smith (op. cit.). has pointed out. 
ever>^one starts learning from scratch. There were no uncontrolled vari- 
ables or indi\'idual differences to 'contaminate' the results. Experiments 
could now be replicated and experimental results verified. The non- 
sense syllable offered the researcher control. 

The results of such research studies made possible the formulation 
of laws of learning, which could be repre.sented by smooth curx^es on 
a graph and summarized by algebraic formulae. Indeed, the history of 
the experimental study of learning has been a struggle between re- 
.searchers trying to devise better nonsense and subjects tiying to make 
belter sense of it. However, the fact that some laws of learning could 
be said to be based on nonsense has been generally ignored in the field 
of education during the first half of this centuiy, although its influence 
can be found even now. 

There are two common assumptions underlying this view of learn- 
ing: on the one hand, to understand something you first must learn it, 
and on the other hand, that there is no point in studying something you 
already understand. Hoth these a.ssuniptions are wrong. Indeed the 
reverse is true. To understand something does not mean that we know 
it already, but that we can relate it to what we know already, that we 
can ffiake sepise of it. In order to learn you ha\'e first to imderstand. 
These erroneous assumptions confuse understanding with knowledge. 
We are capable of understanding things we do not already know and 
we probably do so ever)' moment we are awake and not bored or 
bewildered. When we read something we imderstand, we make sense 
of it even though we did not know it already; and as a consecjuence 
we learn. The only thing we are likely to learn if we are confused or 
bored is that the text is confusing or boring. Smith (op. cit) continues, 

.a 
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if children read material that is confusing and boring, they quickly learn 
that reading itself is confusing and boring and subsequently make little 
progress with literacy. 

Developiiientally, a growing understanding of the nature and uses 
of print does not occur in a vacuum. It depends on living in an environ- 
ment where print is important. It depends on interactions with print 
which are a source of social and intellectual pleasure for both children 
and those around them. Indeed, researchers have pointed out that 
awareness of forms and functions and uses of print provides not jusi 
the motivation for learning to read, but the backdrop against which 
reading and writing are best learned. The likelihood that children 
succeed in learning to read at school, therefore, depends most of all on 
how much they have already experienced about reading before they 
gel there (Raban, 1984) and this is where children's prior knowledge 
comes from. This finding is also reflected in research findings from 
other countries (Clay, 1979; Ferreiro and Teborosky, 1983). 

Styles of Research 

What we know about children learning to read is based on evidence. 
The nature of this evidence emerges from a variety of different sources 
and gives practitioners a number of mixed messages as they look to 
research to inform their classroom actions, in particular, as they look 
for ways of helping children who are making slow progress w ilh their 
reading. 

An important source of this information comes from academic 
research, although this has not always been helpful. For instance, while 
allending a conference in America a few years ago, 1 was reminded that 
people, even those who share the ^mie concerns, do not share the 
same views of the world or the way in which it is organized. What 
counts as evidence for one group is unconvincing for others; what is 
important for another group is seen as insignificant when viewed from 
a different perspective. 

At this conference findings were reported from a large-.scale study, 
a micro-analysis of classroom reading. In this research based on chil- 
dren aged 7-9 years, video recordings were being analysed in order to 
define, describe and seek the role of attentive behaviour in reading. It 
was hypothesized that increased attention on the part of the child would 
decrease oral-reading errors, and this decrease in errors would in turn 
lead to a belter understanding of what was read. 

These videos provided data which were being analysed in minute 
detail. The outcomes were being subjected to statistical analysis derived 
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from models developed in the sphere of biomedical quantitative 
research. The time, effort, money and sheer ingenuity in\'ol\'ed in this 
major research initiative was breath-taking and was beginning to yield 
tentative results: 

• bo\^s are more inattenti\e than girls: 

• . younger children are more likely to be inattenti\'e; 

• difficult texts increase inattention: and 

• low-ability children tend to lose concentration. 

Practitioners can be forgiven for asking if research of this complexity 
and sophistication really is necessary* to find out things which are all 
(juite clear to any teacher or parent. Here we see the beginnings of a 
dilemma, identified in this instance by what this researcher would count 
as evidence and the nature of that e\'idence. 

What is at stake here are the questions: What is knowledge? — 
and, more importantly — Whose knowledge counts? If research, through 
its activity of systematic investigation has any role in contributing to our 
knowledge base, what we are finding in reading research is an uncom- 
promising emphasis on a single set of assumptions about the world, the 
way we experience it and how we learn from this experience, and this 
in turn informs our research literature. 

As someone with the responsibility for publishing an annual re- 
view of reading research in Britain. I have the opportunity for tracking 
pre.stigious journal articles and the outcome of research funding pol- 
icies. In this countr\' and elsewhere, studies which get published pre- 
dominantly emanate from a single research paradigm. This paradigm 
rests in the empirical/analytic tradition and is based on assumptions 
taken directly from the physical and biok)gical sciences, assumptions 
which, for instance, have given us theories of learning nonsense. 

Examples of funded research embedded in this paradigm include 
the Economic and Social vScience Research Council (EvSRC) study which 
reports that when text is manipulated through misspellings and stimu- 
lus degradation, partially sighted subjects slow down their cn erall read- 
ing speed. Also the Medical Research Council (MRC) .study which reports 
that partially hearing subjects are poor at judging rhyme. 

The kind of per\'asi\'cness enjoyed by this style of research can be 
illustrated by the percentage of papers published in the top interna- 
tional journals. For instance, in a sur\'ey of twenty-one Nolumes of 
Reading Research Quaiierly 97 per cent of the articles are from this 
perspective. 99 percent in Joitnial of Reading BeharioK and 98 per 
cent in foiniial of Reading Rese({rcb, Hven in those journals which do 
not attract such prestige, for instance. Research in the leaching o/Biglish 
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and Language the percentages are "2 and 77 respectively (Shan- 
non. 1989). 

Taken together, the assumptions of the empirical anahtic tradition 
enable reading researchers to treat all questions as if they had a single 
answer to what is . . . ?": 

• What is reading? 

• What is the most effective way of teaching reading? 
What is learning? 



Of course these questions are too complex to address directly in single 
studies, so questions like these get subdix ided into components. For 
instance reading becomes attention, decoding, comprehension and then 
each of these subsystems is dix ided exen further in the belief that the 
bits can be put back together again to address the major issues. 

In an imponant sense, these researchers are unconcerned about 
the effects of their work on the total cultural and social framework of 
reading instruction. Their purpose is to investigate cognitive processes 
and develop rational systems for testing and validating their own hy- 
potheses. Research paradigms, whilst providing opponunities to bring 
a particular order to the world, also set constraints on what researchers 
can .see in that world, on the que.stions they consider legitimate and 
important, and on the technicjues they choose to answer those cjues- 
tions (See Chapter 1). 

Wilding (1988) has conducted extensive work on experiments 
relating to lexical decision tasks. In these tasks, subjects are recjuired to 
reject or accept letter .strings as words or non-words. He finds that this 
type of task, whidi appears to be currently central to psy chological 
investigations of the reading process, needs modification if sense is to 
be made of the results. However, he acknowledges; 

it may be more justifiably argued that such complications move 
the investigation away from the original concerns of the 
research . . . and that the task will become even more unlike 
normar reading. (Wilding. 1988) 

Wilding also hints that lexical-decision task experiments may only 
have marginal relevance in the investigation of reading anyway. In a 
recent volume of Rcadi^i^ Research Quaiterlw Anderson. Wilkinson 
and Mason ( 1990) have identified the problem in all of this. They state 
that reading research and teaching reading have been moving 'out of 
earshot' of each other. They continue by .saying that research on read- 
ing reflects little of what has been discovered about reading. 

46 
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What is being argued here is not thai the dominant paradigm is 
necessarily wrong, but that this paradigm, because of elitist forces, is 
detemiining the research questions which it is legitimate to ask. The 
reverse, of course, should be the case. Research questions themselves 
should be determining the choice of paradigm and paradigmatic diver- 
sity should be the more common characteristic of academic reading 
research. Questions of 'What is . . . ?* clearly need to give way to ques- 
tions of How do . . . 

How do children learn? 

How do children learn to read? 

An example of what is referred to here as paradigmatic dix ersity is 
K) be found in the recently' reported research from two teams at Har\'ard 
University headed by Jeanne Chall (Chall et ai, 1990) and Catherine 
Snow (Snow ct ai. 1991) respectively. Both teams of researchers fo- 
cused on the same sample of children, although they conducted their 
research in quite different ways according to their own paradigmatic 
preferences. While their findings are reported in separate books, the 
coincidence of their results is remarkable and when the books are 
studied together they provide a richness of confirmatory insight rare 
among academics. 

What is clear to those of us working in the field of literacy, is that 
practice in certain respects has outstripped theory and experimental 
evidence, and there is a loss of faith in how the two communities of 
research and practice can relate and articulate their concerns. This has 
resulted in the unfortunate circumstances of the moment which sees 
reading researchers unable to combat the force of ill-informed concern 
about the teaching of reading in our school. Indeed we are being pushed 
to one side by a tyranny of ignorance and its unabashed confidence. 

In an important sense we are unable to join the fray — our training 
has prepared us well for the university seminar and the conference 
symposium, but leaves us (|uite unprepared for the discourse of media 
which has neither the time nor the- patience for considered debate and 
reviewing all factors. Such discourse as ours does not translate easily 
into banner headlines nor does it fit well into two minute radio and T\' 
slots. Tho.se who have these skills do make their voices heard, although 
they are not always the voices we would wish to hear. 

Learning to Read 

Brock (1900) has identified several of these voices, which he refers to 
as myths. One powerful and enduring voice continues to call out that: 

M 
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reading is accurate ciecocliHg. This is what purpons to he the 'common 
sense' view of reading. If text is made up of words and words are made 
up of letters, then start with the letters. Such a view has a long heritage. 
An instance of it can be found more recenth' in 1970 when the National 
Right to Read programme was launched in America, It was launched 
with all the euphoria of getting a person to the moon and back, and the 
same principles were to be adopted to teach reading. 

As Smith (op, cit,). has pointed out, getting to the moon and hack 
Vv'as rightly seen as a complex logistical problem. It was seen as a 
problem which was soh ed one step at a time, by breaking down the 
task, through systems analysis and linear programming, into manage- 
able units, with each object i\'e rigorously assessed for achie\'emenl. 
and with quality control monitored, the final objectiN'e could be reached. 
It was assumed that identical procedures would deliver literacy to all 
children, by teaching one small step at a time and constantly testing 
that each objectiw was successfully achiexed. The intellectual and 
publishing muscle of America was called on to help reach this ambi- 
tious target. 

Theories were deNeloped to support and justify this new frag- 
mented approach to education. In time the idea of suhskills became 
raised to even greater heights and referred to as the basics' along with 
emotional assertions that education should return to them. It is a famil- 
iar story in this country- too. The underlying argument remains the 
.same: the pathway to successful achievement is a trail of meaningless 
fragments, and this \'iew has been with us for a long time. 

In the seventeenth centur\': 

the ordinary- way to teach children to read is. after they ha\'e got 
.some knowledge of their letters and a smattering of syllables 
and words, to make them name the letters, and spell the words 
till by often use they pronounce at least the shortest words at 
first sight, (Hoole, 16(){)) 

In contrast, by 189.^ Rice, a noted American radical who made an 
extensive sur\ ey of .school practice including inter\'iews with teachers, 
argues for the use of a variety of teaching methods in teaching reading. 
And in the fifteenth edition of the 7cachers' Matiual of the Science a)J(l 
Art (>J 7eachi>ig dated 1898 teachers are urged to consider good read- 
ing as good speaking; 

children cannot read what they do not understand, they cannot 
in such a case interpret the thought of the writer. (National 
Society's Depositor}'. 1898) 
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Under the section in this book which addresses methods of teaching 
reading, it continues; 

The old fashioned way of teaching words (was) by beginning 
with letters and combining these letters into syllables and words. 
This method does not meet the difficulty that every 20th word 
in the English language presents. You will find that in practice 
you will have to combine methods, (op, citJ 

This was suggested 100 years ago and probably not for the first time 
even then. 



Messages for Teachers 

Children learn language, literacy and other things holistically; the>' do 
not learn in fragments of isolated skills. Language, whether spoken or 
written is a complex set of subsystems which interrelate. And thev 
interrelate with increasing sophistication as development proceeds, 
Succes.sful language learners begin with a reason foi* using language, 
and move gradually towards acquiring the forms which re\'eal those 
functions. As has been pointed out. children learn language; they learn 
to use language and go on to learn about language (Halliday. 1973). 

Clay has been reminding us for more than twent>' years of one of 
the major outcomes of her extensix e observational studies of children 
learning to read and write. 

Children do not learn about language on any one le\'el of or- 
ganisation before \\\i^y manipulate units at higher levels, although 
many leaching schemes believe that this is so. (Clay, 197S. p. 19) 

Clearly, boih skills of accurate decoding and comprehending messages 
in text are necessary-. But while they are nece,ssar>' conditions of literacy. 
lhe\' are not sufficient conditions fully to describe literacy. Literacy 
also involves the ability to respond critically, sen.sitively and \\ ith dis- 
crimination to what is read, heard and seen. Promoting approaches 
to uncritical literacy, which are encapsulated in the rigid teaching of a 
sight \*ocaI:>ulan' or of phonics or the exclusi\'e use of a single reading 
scheme can only again lead to a lowering of standards. One wonders 
whether it is one of the current political goals of education to produce 
critically literate people? 

A now well-known DMS memo, circulating the ministiy at some 
time during the early IWOs said in pan: 

.^9 
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we are beginning to create aspirations which society cannot 
match . . , When young people drop off the education produc- 
tion hne and cannot find work at all. or work which meets their 
abilities and expectations, then we are only creating frustration 
and perhaps disturbing social cons^'quences, (DFS cited by 
Simon, 198S) 

The memo continues: 

if we have a highly educated and idle population we may 
possibly anticipate more serious social conflict. People must be 
educated once more to know their place, (op, cit,) 

C'.ontrast this position with that currently predominating in America, 
Six goals have been set for education in the 1990s by the President. The 
target date is the year 2000. Three of them are worth mentioning in this 
context; 

\, All children in America will start school ready to learn. 
(Implying, of course, that learning only takes place in 
school?) 

*. rS students will be first in the world in science and math- 
ematics, 

S. Hver\' American adult will be literate . . . 

These directiv es have been gi\'en against a backdrop of the report from 
the National A,ssessment of educational Progress which indicates that 
rS students achieved at roughly the same level in reading as they did 
twenty years ago — indeed much the same stor\' as in this countiy. 
However, the I'S ston' is told against a background of phonics teaching 
and ours against a position which is hcKl to be the reverse. 

Brock (op. cit.) identifies further voices which claim that: there is 
sonictbifi^^ allied J){}icti(»i(il literacy irhicb am he measured by stand- 
ardized tests. 

In 1992 when the International Reading Association honoured the 
Director Cieneral of TXHSCO. because woddwide illiteracy rates had 
dropped, it is worth reminding ourselves what functional literacy might 
mean. What is ovedooked in the eagerness for worldwide literacy, is 
that there is not just a single commotiity called 'functional literacy*. 
What is functional literacy for a (*ar mechanic who consults complex 
technical manuals of engine specifications, may not be the functional 
literacy of a parent with a medic ine bottle or an adult with a job speci- 
fication and an application form. I'or instance, in Hrice-Heatirs study 
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(1983) of two communities in the Carolinas, she illustrates how telling 
stories in one community is a form of cultural sharing and in the other, 
stories are considered improper. In this sense, what does it mean to talk 
about rates of illiteracy, or rates of functional literacy in a national 
population? I, for instance, as a university professor, might be consid- 
ered by most to be a literate person, but if my reading contexts are 
changed from educational research, current affairs, literary texts and so 
on, to income-tax returns, instructions on how to assemble a wardrobe, 
or computer manuals, even more recently, to lengthy documents from 
Senate House, then I am in deep trouble! 

What we see happening here is that literacy needs to be defined 
by context, opportunity, and by experience and prior knowledge. To 
narrow down the complexity of literacy to arbitrarily chosen criteria of 
word recognition and the like is inappropriate. It pays little attention to 
historical changes and significantly different contextual circumstances 
that coexist within diverse communities. It also ignores the differing 
degrees of litenicy competency operating within any one individual. To 
follow the road of arbitrarily chosen criteria will result in dropping 
standards not raising them, as we have seen in the current testing 
regime currently being put in place in England and Wales. It is worth 
remembering that the more obser\*able an aspect of language behav- 
iour, the easier it is to assess and the more likely it is to be trivial. 

Conclusion 

The issues raised in this chapter are enduring threads which ha\'e woven 
their way through our National Curriculum Council English-evaluation 
project ( Raban et al., 1993) ^nd repeatedly I haw been brought up against 
these tensions. Firstly, in a review of theories of learning, theories of 
teaching and theories of curriculum content: how far should these come 
together in practice or should be located apart from each other in some 
form of "fitness of purpose? In the English curriculum there are vested 
interests pulling in both directions. 

Secondly, views of language which deal developmentally with the 
smallest units of obserx able behaviour first, and then build the curric- 
ulum through increasing levels of complexity, act in opposition to the 
ways in which children use language and the way they learn language, 
whether spoken or writu*n. However, starting a curriculum pathway l^y 
acknowled-,ing that chikiren operate on all levels of language at once 
from the beginning, leaves us with a developmental and assessment 
framework which merely states that they get better as they make progress 
through school. 
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These issues are causing problems as the English Order is revised 
and redrafted because, since the beginnings of mass public education 
in Britain, the teaching of English as a whole has been a focus of keen 
political interest and political control. The definition of what is to count 
as English (speaking and listening, reading and writing) has always 
been a matter of conflict between contending interests. The \ iews people 
have of what literacy involves, of what counts as being literate, of how 
we learn and teach literacy, what they see as 'real' or appropriate u.ses 
of literacy skills determine, both directly and indirectly, the realities of 
practice. 

As dissatisfaction intensifies with regard to the products of our 
maintained schools by commerce and industr}% in the public mind a 
return to the teaching of standard English and formal grammar along 
nineieenth-centur>' lines has come to be equated with a rise in stand- 
ards, not only of literacy, but of morals and beha\'iour in society as a 
whole. This is the teaching which is perceived to characteri/^e English 
leaching in our pri\'ate schools and our best grammar schools of the 
past. As one commentator has commented in Ihe Sunday Times-. 

Grammar is the fastest rising topic in the Tor\' party firmament, 
now almost on a par with hanging and dole fraud . . . The na- 
tion's grammar stirs the juices. ( Wc SiuhJay Times. November 
1988) 

There remains much work to be done to hold the groimd between 
the.se tensions. This is especially so as we see teacher training reduced 
to a recjuirement for subject knowledge coupled with classroom- 
management skills ak)ngside a wilful disregard for theories of learning 
and their reality in the classroom. With respect to the English curric- 
ulum, the recursive model of language development found in the current 
curriculum, rooted in research and matched by the experience of teach- 
ers, is eliminated by the recent proposals for a re\'ised curriculum. In 
siiort. the new assessment statements, for instance, are even more ar- 
bitrary* and remote from empirical evidence. The model of language 
and learning implicit in the proposed curriculum is that of deveU>ping 
language skills through conscious mastery of rules and much guided 
practice. 

The long-term con.secjuences if these proposals are implemented 
will be a narrowing of the curriculum with a heavy empha.sis on cor- 
rectness a'nove all else. The benefits which teachers iia\-e gained from 
a dexeiopmental approach will be lost as pressure is experienced to 
leach the con\entioiis of speaking, listening, reading and writing to 
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children who may not have enough experience to make these mean- 
ingful. The proposed National Curriculum for English, especially that 
part of it to be assessed, would if it were to be implemented, be one 
that ignored most of what has been discoN'ered about the development 
and teaching of oral and written language over the last thirty years. It 
is based on an unreal vision of what children are like and how best 
they can be taught. The view of learning which it puts forward is one 
which assumes that learners are passive, compliant and of similar back- 
ground experiences, interests and abilities. This is simply not true. To 
suggest that children come into classrooms as empty vessels is an insult 
to their different types of experti,se and to their validly different expe- 
riences of the world. 

If we want our children to understand the complexities of our 
societies and to engage in constructive dialogue about the wodd in 
which they li\'e. we cannot ask them to leave their language, their 
experience and their culture on the doorsteps of our schools. More 
importantly, if we continue to allow ignorant pronouncements about 
the Hnglish curriculum to go unchallenged, we are in danger of produc- 
ing a generation of young people whose own language has loeen de- 
valued, whose self-esteem has been damaged and whose learning has 
been impeded. 

If we as teachers attempt to funnel the richness and diversity of 
our classrooms into an inflexible, monocultural curriculum framed in a 
rigid adherence to. for instance, standard Hnglish for all purposes or 
phonics as the only method of teaching reading, then we deny what we 
know about the way people learn and we shall, at best, marginalize 
and, at worst, destroy the voices of our pupils. Their voices de,ser\'e our 
attention. 

Canmell ( 199,5). during an investigation of Hnglish in institutions of 
higher education, asked how these departments saw the future of Hnglish 
studies. Their collectix e response was characterized by diversity. The>' 
saw new options coming on stream concerning cultural and media 
.studies. They saw the widening of the canon of literary authors and 
lexis to include more women and ethnic-minority writers. They per- 
ceived the study of Hnglish literature becoming the study of literatures 
in Hngli.sh. They .saw Hnglish as an increasingly interdi.sciplinary study, 
with a fuller integration of language and literature studies within a view 
of literary^ studies which would be sufficiently inclusiw to embra» e 
ixnli canonical and non-canonical texts. 

Staff in these university departments of Hnglish agreed that recent 
government proposals to proscribe what happens in schools* Hnglish 
curricula is a thunderstorm on their horizon. They were vociferous in 
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wanting to combat this whenever it appeared. To this end, forty pro- 
fessors and lecturers of English signed a letter which appeared in the 
British press during 1993 strongly objecting to the proposed revisions 
to the school English curriculum. The proposals, they maintained; 

reject the consensus of professional opinion about the best 
practice in schools; they involve an unacceptable degree of 
political and statutory control over classroom teaching , , , a new 
curriculum which so misrepresents the subject of English is an 
inadequate preparation for its study at higher levels. 

Finally, at university level there are exciting new developments 
and trends in the study of English leading to cultural and media studies, 
women's .studies, creative writing and the study of other literatures in 
English. What we .see happening here is tliat if the revi.sed proposals 
are implemented then the .study of English in schools will slowly but 
definitely be pushed out of line from the teaching of English at higher 
levels. There is a danger that such a disjimction will fracture our intel- 
lectual heritage, destroy homogeneity within the profe.ssion and .stran- 
gle the freshness of discourse v\ hich characterizes the vitality and richne.ss 
of English teaching. 
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Summary 

Approaches generalK' applied in the development of literacy 
locally may be broadly described as skill-based. The underlying 
assumptions are taken as 'givens'. Methods of instruction, con- 
sistent with these approaches, achieve standards (determined 
through a primarily punitive approach to assessment) which 
cause much public concern. Data from obser\ation and teacher 
information o\'er two terms of instruction, provide the descrip- 
tion of the status of the learners experiences and knowledge 
especially with respect to language. The discussion brings into 
focus the function of support for. and validation of, the learner s 
dilemma in the effort after sense, and examines the concept of 
universals in literacy instruction and the relevance of the notion 
of context as dynamic. 



Introduction 

Some st)cieties are experiencing a growing anxiety over standards of 
literacy. I'his anxiety luis often been fed by the perception that standards 
are falling, and decision-makers in many instances have responded by 
embarking on national schemes targeting a rise in literacy levels. Those 
of us in countries categorized as 'Third World* have had our anxiety 
iransfonned into qualification for financial assistance from international 
agencies. Professionals in education ha\'e been expending much re.search 
energy, time and resources on literacy. Professionals in many other 
disciplines have, separately or in collaboration with their colleagues in 
education, brought the phenomenon of literacy into sharper focus. 
CAirrenlly, there is a heightened interest in how literacy is accjuired — 
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an interest which has uncovered many issues. These issues enter the 
debate on the extent to which methods of instruction are responsible 
for standards in literacy. Some argue that in fomial schooling at least, 
methods need to be informed by a somewhat more complete and 
coherent understanding of how children acquire literacy. 

This chapter takes the view that learning to be literate is one in- 
stance of learning and operates on the basic principles of all learning. 
These principles define the learning environment: and individuals ac- 
quire patterns of operating these principles. Violations and disruptions 
of the principles and acquired patterns introduce dilemmas for a learner 
Mismanagement of violations and disRiptions introduce discontinuities 
into the process of learning and affect the nature of learning. Natural 
(versus 'contrived ) learning situations allow potential violations and 
disruptions to be managed as sources of information in operating the. 
principles and building patterns. Contrived learning situations can draw 
'with benefit' on the operations in natural learning situations of which 
learning language use is an instance. 



Conceptual Background 

Brice-Heath (1983) identifies the relevance of the learners knowledge 
of language-in-use to the process of learning to read. In her research, 
children who were ai:)le to read to lean} did so in totally integrated 
contexts: the message and the situation of contact with print were 
dependent on each other. In this way patterns of language use were 
learnt through demonstration: and tiie elements of these situations lacked 
independence. Elements comprising language systems, me.ssage(s). 
situational context, purpose or intent, pairs of participants, processes of 
participation — negotiation and constniction, medium of delivery, as 
well as non-language replications of the niessage(s) exist in dynamic 
interaction. She found the interaction to be such that when an element 
was isolated it was viewed as 'unknown' and therefore lacking signifi- 
cance. However, focusing attention in a way that retained the relatedncss 
ot language-in-use, became another demonstration of language-in-use. 
This simultaneously restored the isolated element to the status of 'known* 
and added new knowledge of that element. 1 his offered the learner a 
perspective on the nature of literacy as well as on how one manages 
the process of learning to be literate. 

Frank Smith (198.^) argued for demonstrations, engagement and 
sensitivity as components of the approach to language learning in sc hool. 
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This approach makes use of the learner s familiarity with the components 
in natural language learning. 

This argument supports the demand for continuity between formal 
instRiction on the one hand and the learner s language knowledge and 
experience with systems of communication and ways of using language 
on the other. Clay, writing in Reading Today comments on the value 
which meeting this demand has for achieving independence in learning 
and economy in teaching: 

They [learners] have abilities to initiate, construct, and actively 
consolidate their own learning. If we enlist these learning ini- 
tiatives, then we do not ha\'e to teach so much . . . (Clay, 1993. 
p. 3) 

tjilisting their learning initiatives requires some understanding of 
what the learner brings in respect of those elements comprising the 
natural learning situation. Enlisting their initiatives means providing for 
the needed continuities with, and support for, what the learner already 
knows about appealing to the totality of his or her prior knowledge and 
about allowing the context of situation to signal the rele\'ance of what 
he or she knows for the task at hand. Enli.siing their initiatives means 
aUowing for the learner's ownership of the goal or goals to be achie\'ed 
by the application of what he or she knows. 

But enlisting ilieir initiatives must simultaneously mean providing 
opportunities for the learner to have access to the facilitator s systems, 
thus providing the benefits of the experiences of others. This further 
demand is for a richness in the context of situation. This richness will 
allow the demonstration and exploration of the known signs and of the 
nature of connectedness at the .same lime that it provides for the seeking 
out of new connectedness, signals and perspecli\'es. 

From this vantage point, the issue in formal instruction toward 
lileracN' becomes that of identifying what it is thai the learner brings and 
what the demands are of nuikiug sense — of percei\ing a relevant and 
coherent whole in the contrived learning situation which characterizes 
school. 



(Jnestions and Issues: An lixploratfon 

II seems useful to focus on these issues and demands as they are 
manife.sled in the practices and assumptions of literacy de\ elopmeni in 
classrooms. A relevant (juestion then becomes. What is demonstrated 
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through the practices of classroom instaiction? The ensuing question 
then is: What assumption underlies the choice of demonstration? A final 
question relates to the engagement of the learner in these demonstra- 
tions and assumptions. 

The questions, concerns, and issues in this chapter are raised and 
explored with reference to a context broadly described linguistically as 
Creole. This, in general terms, labels a situation in which the language 
commonly in use is derived from languages in contact. The specific 
Creole context of this presentation is one with an English-related Creole 
in a situation where a standard form of English is the official language. 
Literacy is taught with respect to this official language. 

The English-related Creole in focus was bom in a socio-political 
power structure associated with slaver>\ While African slaves out- 
numbered masters, linguistically, slaves were not a single unified group. 
The perfection of the model of English offered by the motley crowd of 
masters Lmd masters' representatives may have been equally discon- 
tinuous, Commimicative needs and intents had therefore to fabricate 
bridges and blaze trails thus giving rise to new and different language 
forms with the resulting language essentially an oral form. 

While currently, the linguistic situation may be described as a 
continuum with the Creole at one end and standard English at the 
other, and with sections of the population having access to the full 
scale, many haw not mastered standard English, This has been the 
source of dis,satisfaction with secondar\^ school-leaving achievement 
assessed through examinations expected to be written in standard 
English, The (iual't\' of performance in English language on these ex- 
aminations is largely interpreted in terms of literacy levels. Some claim 
that learners enter the secondary- phase of the system with low levels 
of literacy and suggest a link between this and the (juality of perform- 
ance and achie\ ement in the official language. Some examples of dif- 
ferences between the two language forms are presented in Table 3.1 
while some issues related to the medium of presentation are suggested 
by the material in Table 3 2. 

The exploration of literac\' learning in this context was structured 
to prov ide a description of classroom practices and their assumptions 
with respect to literacy development. This is seen as representing one 
perspective on conditions and processes of literacy acc|uisition and 
providing a launching pad for negotiated .staff development as a 
background against which to elaborate on the initial description of 
practices and assumptions, 'Hie choice of a language focus rellects the 
view of learning to be literate as learning the operations of the written 
form of langu;ige-in-use. 

•/O 
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Table 3.1: Examples of Jamaican Creole — standard English differences 



Jamaican Creole Standard English 



Phonology 


bud 


bird 




jinks 


drinks 




jugs 


drugs 




chiljen 


children 




chuck 


truck 




chavel 


travel 


Semantics 


ignorant 


irrascibie 


Syntax 


a beat dem beat im 


he was beaten/they beat him 


a im dcrn beat 


they beat him 




a dem beat im 


they beat him 


Table 3.2: Attention to word boundaries 


Jamaican English 




Standard English 


one suppond a time 




once upon a time 


mixty motions 




mixed emotions 


a tired 




attired 


shed rule 




schedule 


to wide in 




to widen 


earth wake 




earthquake 



Source: Scripts from a regional examination taken at end of secondary school. 
Note: Candidates are all from a linguistic context in which English is the official language 
while the majority of the population speak a Creole form and interact in a culture which is 
dominantly oral. 



Method 

A central and necessary' concern of any investigation of literacy in the 
context outlined above is to find instmctional paths to the success by 
which the achievement of schools is judged, which some need for 
social mobility and betterment, and about which many seem to care 
very little if at all they care. In general terms, research identifying stand- 
ards of performance in relation to socio-economic status or to ex- 
penditure in education have not offered much direction for improvement 
and for instruction. There is a sense of the inadequacies of the usual 
form of research cjuestions and the paradigms used to investigate them. 
The concern with norms or with predefined categories continues to beg 
cjuestions and to sideline those learners who do not satisfy the norms 
or fit the categories. It seems that these learners deser\'e to be led to 
achieve whether or not they fit the categories. Research findings based 
on a control experimental-group paradigm point to a common curric- 
ulum as the road to achievement for all and raise the issue of the 
validity of assuming universals. 
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Against this background, the decision was taken to collect data 
with the capacity to describe the extent of the continuity (with respect 
to hteracy) between learner knowledge and school practices and \\'ith 
an openness to seeing actual patterns rather than instances of a precon- 
ceived pattern or set of patterns. The part of the project reported here 
used school interviews and obser\'ation schedules and a series of per- 
formance tasks. 

The schedules for school focused on the teachers' classroom prac- 
tices as they relate to literacy: 

• the nature of engagement with language; 

• the roles of participants and patterns of goal selection; 

• the structure of situations/events in which learners are engaged; 
and 

• the concept of literacy and literacy learning projected. 

The data were collected in the context of a model of staff develop- 
ment in which teacher participants individually select an aspect for 
development. A series of video tapes formed the basis of discussion of 
classroom practices with particular reference to teaching literacy devel- 
opment. These became the stimulus for selecting an aspect. Prior to 
this, teachers had pro\*ided a written description of their own practices 
for which they were encouraged to provide the rationale. 

The teacher-participants were from two schools and were teachers 
of initial grades in the primar>- system (age group: 6-11+ years). The 
schools are in two distinctly different geographical locations: one deep 
rural and agricultural and the other approximately 120km away on the 
outskirts of a flourishing tourist resort and dependent on the resort for 
employment but self-sufficient with respect to social interaction and 
enteitaiiiment without being a closed commimity. The deep-rural school 
contributed three teachers while the other contributed two teachers. 

From the performance tasks which learners were asked to do, 
information was sought on their access to language production: 

• verbal response to picture stimulus; 

• non-verbal response to picture stimulus; 

• concept of use of language in writing; and 

• application of knowledge of sound values attached to letter 
symbols used in writing and reading. 

The picture stimuli allowed for labelling of objects, labelling of actions 
and explaining as verbal responses. In the other picture-stimulus task. 
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learners were asked to draw something similar in function to the picture 
the identity of which they had satisfactorily established and the func- 
tion of which they had accurately described (somewhat equivalent to 
the task of identifying a word with a targeted initial sound). Another of 
these tasks required the selection and reproduction of a solid shape 
needed to fix a cup which was not holding juice because it had no 
bottom. 

To explore their concept of writing the children were encouraged 
to write their own stories which they could read to me. This task af- 
forded the opportunity to check on their knowledge and use of the 
sound values of letter symbols in writing and in reading. 

Results 

From these data one arrives at descriptions and interpretations rather 
than results. From the gathered data, a recurring element of the practices 
aimed at literacy development is the use of picture reading in which the 
focus is on details of the picture or illustration. Care is usually given to 
seeing that the learners recognize these details as familiar. Learners 
generally provide verbal labels for objects in the picture or illustration. 
Pictures or illustrations normally carry a single-sentence caption learnt 
by repetition which is viewed as essential to learning. Attention to word 
forms is given in a different context and without reference to the words 
comprising the caption. Instruction targeting word recognition brings 
into focus words connected by their structure and with the unwritten 
requirement that they be monosyllabic. The stated objective is to teach 
the initial sounds and their representation. The task is set in the context 
of the teacher listing the target letters (e.g., ni k f\ s) and providing 
words with missing letters. The learners are to supply from the list of 
letters that which is missing from the words provided. 

Underlying this structure of instruction is the view that letter-sound 
knowledge is a recjuirement for reading to begin. While much attention 
is given to the phonological system of language, the elements are 
detached from the meaning functions they routinely serve. Thus the 
practices seek to engage learners with language forms in the written 
medium. Attention to the communicative intent of language is delayed 
until mastery of sound-value representation is assured. Performance on 
the various tasks suggests that learners, if they have stored this rep- 
resentational information, eilher do not have access to it or do not 
know its purpose (see Sample j. Figure f>A). Oi course, these options 
need not be mutually exclusive. 



Figure 3. 1: Sample J: Silbert 
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f3c)th the response to print and the response to the demand for 
print occurred as reproductions from memory without any apparent 
concern for accuracy (see vSample X. Figure 3.2). For some, the demand 
for print ex'oked a search for ax ailable copies of writing even when the 
learners were imable to read what was copied (see Sample Y. Figure 
3.3). The writing was apparently not recognized as real language and 
the forms not seen as functional or to have any connection with a 
nie.ssage (see vSample K. Figure 3. O. 

The language of reference in the instructional practices was stand- 
ard Knglish which is not the first language of any of the learners. This 
means that learners have no language point of reference for the language 
system with which instruction is working. A dilemma exists here for 
learners facing form in isolation from function and without a point of 
reference. Perhaps the nature of the learners problem is accessible if 
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Figure 3.2: Sample X: Larry 

^■^ii ^^y^ QAi {OOKfof 

loOl ^-t)>r^ ^ l^ek Tor ft 



it IS noted that the phonological system of the learner's language has 
the capacity for producing the following as /7o?»cphones: 

thought ( / taught/ ) — taught 
drugs ( /jugs/ ) — jugs 
truck ( / chuck/ ) — chuck 
trunk ^ chunk. ) — chunk 
hot ( hat/ ) — hat 

It is also true that the first language does not use a system of marking 
number in action concept words. Hence the form distinction between 
has and hare is not salient so the rele\ ance of the difference often 
escapes e\en older learners. 

Smith (op. cit.) observes that demonstiations, engagement and 
sensitivity are cmcial to learning. This, in a sense, describes the ecology 
of sense-making which is what is central to learning. The distance 
between the learner s language knowledge and the language knowledge 
recjuired by the emphasis of literacy instruction makes demonstrations 
and engagement impossible and introduces the expectation of difficulty 
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Figure 3.3: Sample Y: Gary 



LI c^B. 





Figure 3.4: Sample K: Latoya 



Pat aui oarnplay J},,, J^,^^ 




BEST COPY AVAILABLE 
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and experience of failure. The absence of such expectation and experi- 
ence is what Smith labels \sensitivity\ Harsie, Woodward and Burke 
(1984) observed demonstrations, engagement and sensitivity as a re- 
curring underlay in successful oral-language learning and in literacy 
learning in learners in a range of ages. 

The code emphasis which characterizes the practices in these 
situations has been argued by some theorists to be valuable in the 
beginning stages of reading (Chall, 1983; Perfetti, 1985), As a universal 
principle, it seems questionable in circumstances such as those of the 
learners described here. The environment defined by the learning con- 
text will need to be a modifying input in applying this as a principle. 
U\ like the school, the home-belief system includes an acceptance of 
the principle then the learner's dilemma can be intensified. 

It was observed that where the teacher's practices reflected a 
modification of this belief, the learner did not face the dilemma to the 
same extent. However, many of these learners appeared o be having 
difficulty overcoming the exposure to the more prevalent practices. The 
learners in that context were making reasoned demand for letter-soimd 
help. 

In addition to tlie abo\ e, classroom practices supported the role of 
the learner as passive and teachers as the sole judges of the quality of 
wliat is achie\'ed. Learners* responses were accepted in relation to their 
freedom from 'mistakes*. Mistakes were obser\ed to W reasons for 
condemnation ratlier than growth points. An emphasis on the code or 
the material can only result in teacher-selected goals in the circumstances 
of tliese learners: and access to standards of achie\'ement is exclusive 
to the teacher. Garner (1990) argues that these conditions can be ex- 
pected to be deterrents to strategy use. Hence, e\ en if learners had 
ways of learning, the conditions defined by the classroom practices and 
a.ssumptions do not invite or support their application. 

On llie perfonuance tasks the learners were o'oser\ed to be copy- 
ing from a model e\'en wlien tliey were to he writing their own stories*. 
Tliey apparently did not expect tlieir efforts to be acceptable if they did 
not conform to that wliich they had come to identify as the teacher's 
standards. In practice, all their writing was copying from the chalkboard 
written language generated by the teacher. The dilemma was that the 
children could accept no responsibility for writing a.s composing (see 
Sample \\ I'igiire 3."^). 

As can be deduced from the foregoing, the structure of the situation.s 
in wliich the learners were engaged could not be described as com- 
municative. The component o\ jniiposc \\'\\\\ reference to both the im- 
mediate context and the u.ses of language wa.s missing (Ilalliday. l^)"'.'^. 
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Figure 3.5: Sample F: O'Brien 
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O'Brien 




S5ia 




Gee hny name ^cre.. 
My name-fa^ 



Siniilu op. cil.). There was, in a way llie ec|ui\ alent of* gathering blocks 
hul w ith nothing to build and no reason to build. 

What is the concept of literacy and literacy learning being devel- 
oped by these learners from these circumstances? These learners were 
not ()bser\ ed to have any interest in books and were without the texts 
provided by the system because they had destroyed the books. \*'..ry 
few di.splayed any interest in available writing; and ewn these few did 
not show su.stained interest. Literacy acli\'ities were, by and large, .school 
activities. Where practices were modified away from being materials- 
centied and towards being learner-centred, the learners showed an 
interest in reading each others' stories* and expected each to be .saying 
.something different. They e\en expected to be able to laugh when they 
read. While they were not attempting the u.se of writing for personal 
functions, they were more willing to do writing as composing on 
(Icniand isijij^ Sample 1). I'igure .^.0). 
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Figure 3.6: Sample D: Jermaine 




It is not clear whether these learners are yet aware of how they can 
use much of what they have in order to learn to be literate. Learning 
to be literate is more something someone — the teacher — does to you 
than being something the reader (learner) is doing. The notion that 
treating small elements makes learning easier may need to be examined 
against the realities of natural learning situations, Hartlett (1932) cited 
by Ivan-Nejad, McKeachie and Berliner ( 1990) provided the background 
to the development of thought about the multisource nature of ie.'rn- 
ing. This thinking identifies the problems imposed by the principle- of 
simplification by isolatic^n exemplified in the letter-sound code emphasis 
characterizing the practices obserwd, Embedding the concerns of this 
emphasis, according to the multisource tlieoiy, is to have the learning 
benefit from the multiple input of integration, (see Figure 3 "'). The 
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Figure 3. 7: Models of routes to literacy 



1. 



School 
Instruction 
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input from many sources simplifies the issues. Against this t')ackground, 
Resnick's (1987) distinctions between learning in school and out could 
explicate the learner's dilemma in aligning his ways of learning with the 
learning demands of school. 

Messages for Teachers 

As currently conceived, literacy is language-based and focuses on the 
written form. Therefore, in assisting literacy acquisition it seems important 
to recognize the status of the learner's knowledge of language. Perhaps 
the question to be answered is 'What does the learner know about 
language?' This may be limited to a description in terms of form and 
function. Knowing about language is not intended to refer to whether 
the learner can talk about language. Rather it refers to his or her ability 
to operate all the controls needed in the circumstances in which lan- 
guage is being used. In other words, how language is being used, for 
what purposes, and in what form, are important concerns. 

In describing the status of that knowledge, it is necessar\' to establish 
the relationship between the language of literacy and the learner's 
language or languages. The learner's first or spoken language may not 
always be the language in which literacy is being acquired. Also, the 
learner may not have accjuired literacy in his first language. Remember, 
too that the language known will be the only point of reference in the 
approach to written language; at the same time the learner n^ay not 
know this or see the connection. It is from the points of connection that 
a learner's accjuisition of literacy can be directed. 

Some systems of langiiage are not automatically and independently 
accessible even when the individual is a competent u.ser of the language. 
The phonological system can be expected to be particularly trout')le- 
some; differences are known to exist even across groups in the same 
speech commimity. Wheie the writing s\\stem engages arbitrary- symbols, 
the phonological system can be a source of confusion. (Consider the 
representations, hope. rape, lope, sope (soap): reach, react.) Attending 
to the representation of this system in isolation introduces a Ixirden. 
Tlie burden can be eased by attending to the system in the context of 
the purpose it selves, thus giving insight into letter functions. 

Learners know something about learning: that the unknown must 
make a sense that solves the prot')lem at hand... that provides the 
answer(s) needed for the next step or to complete the task. That is, the 
unknown lakes its meaning from what is known. The learner needs to 
be al'>Ie to retain this principle intact, lie or she may need help to 



7he Ecology' of Sense-making 

identify what he or she knows and how to use that knowledge. This 
will provide support for existing knowledge and anchor the new learn- 
ing. What is known about language and its written form will need to 
be activated and utilized. Environmental print (which includes labels 
and packaging) can be a useful starting point. 



Conclusion 

As teachers we would do well to operate as learners in the teaching- 
learning situation as much'as the learner in our charge. The idea is not 
new (Holmes. 19S2). It is professionally challenging. Above alK it is 
eminently worthwhile. In that frame we can be part of the dynamic and 
expanding context which defines learning situations. 
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Summary 

This chapter documents the ex ident success of a project mounted 
in a cluster of inner-city schools in Brighton, England in getting 
less able readers to enjoy reading more and to get better at it 
through effectiw group work. A large proportion of the chil- 
dren involved belonged to ethnic minorities. Results from the 
study suggest that pupils do not learn to produce or interpret 
language effecti\'ely simply by listening to a teacher talking. In 
the teaching of reading, collaboratiw work among pupils is 
essential. 



Introduction 

Hiis .^tudy documents the e\ ident .success of a project mounted in a 
cluster of inner-city schools in Bristol in getting less able readers to 
enjoy reading more and to get better at it (Cionnan ct cii. 1W,S). A large 
proportion of the children in\'olved belonged to ethnic minorities. 

In 1992, Avon LFA obtained funding from the Dfl^ to mount a 
programme in Whitefield Fishponds School and its main feeder primary 
schools to proN'ide support for language-related acti\'ities, particularly 
reading, using an approach that helped pupils to work effectively in 
groups. The local authority invited the National Foundation for Kdu- 
calional Research (NFHR) to advise on methods of evaluating the 
programme. Cionnan, the former head of the Centre for Research in 
Language and Communication at NFFR, was asked to cariy out the 
e\'aluation. In this he had the assistance of John Trimble, an experi- 
enced teacher seconded to the project by A\'on LF'A. 
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The Evaluation 

At the end of the Summer term in 1992, the general reading attainment 
of the pupils in Years 6 and 8 in the schools was assessed through an 
appropriate form of the Reading Ability Senes. a nationally standard- 
ized test (Kispal. Gorman and Whetton, 1989). The mean score on the 
lest of the primary- pupils w as 100.6. The information from the initial 
lest was used by the teachers at both age le\*els to assemble pupils into 
mixed-ability groups. The groups of four to fu'e children each included 
at least one le.ss able reader and one good reader. Sixty groups of 
children were assessed at primary' le\'el and forty-nine at secondary*. 

In the following two terms, each pupil completed two additional 
purpose-designed tests. The acti\'ities im'oh'ed focused on the inter- 
pretation of a poem, in one case, and a set of thematically related in- 
formative materials, including newspaper repons, in the other. The 
collaboration of the teachers invoh'ed and particularly of jane Davies. 
llie head of the Communications Facult)* in the secondarv' school, was 
crucial in devising tests that related to schemes of work in use in the 
schools. 

The Collaborative Reading' tests devised had three main compon- 
ents. The first in\*ol\*ed independent reading. The pupils read the poem 
or reference materials indi\ idually and answered the cjuestions linked 
to these. The second component involved cr>//r//x;;r///rc reading. At this 
point pupils shared their responses and agreed on a joint response. The 
third component invohed rerieir. The children re\"iewed their indi- 
vidual responses in the light of the group discu.ssion and made changes 
if they wished to. On the basis of the changes made it was possible to 
calculate what we leriued a net improvement score' for each pupil. 

The Results 

As would be expected, pupils' scores on the tasks after cliseussion 
were, on average, significantly higher than those gained through inde- 
pendent reading. More interesting and less predictable, was the finding 
that on each task the net improvement score of the less able readers 
was higher than that of pupils in the other groups. That is to say. these 
pupils made a significantly greater number of accurate amendments 
after discussion than did pupils whose initial scores were relatively 
high, 

Our initial assumption was that this pattern of response was a 
statistical artifact in that the good' readers — the pupils who made 
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Figure 4. 1 : Primary non-fiction — Before and after: 3 groups on prescore 
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Figure 4.2: Primary non-fictton. Gain from experiment 
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Figure 4.3: Secondary poetry — Before and after: 3 groups on prescore 
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Figure 4 4 Secondary poetry Ga'n from expennient 
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fewer errors initially — had less scope to improve. We therefore stand- 
ardized' the scores (by dividing the number of accurate amendments by 
the number of errors made on first reading). The results still showed 
the group of low perfomiers making significantly more improvement 
than the high performers on three of the four tasks. There is no ques- 
tion therefore that in reviewing and revising their responses, the less 
able readers applied what they had learnt through collaborative dis- 
cussion more effectively than the more able readers did. 

Figures 4.1-4,2 above show the pattern of response of primary 
pupils in completing the non-fiction task. Secondary' pupils results in 
completing the poetry task are shown in Figures 4,3 and 4.4 below. 

Interpreting the results 

How can the results be explained? Earlier research has shown that, 
typically, less able readers are less willing than good readers to alter 
their initial interpretations of what they read. Having once arrived at an 
understanding of the meaning of a statement, they tend to adhere to it. 
To give an example of this lack of flexibility, consider the results of an 
Assessment of Performance Unit (API*) lest ilhat Sinking Fcclhig) in 
which pupils were asked to interpret an extract from a story* by Hel.sy 
Byars about a little girl (the narrator) who caused a raft made by her 
older brother and his friends to sink, after they had refused to let her 
play with them (Gorman and Kispal, 1987), When asked, the majority 
of the fifty-two pupils in the k)wesl performance band, and all but one 
boy in that group, said that the person telling the sioiy was a boy, 
despite the linguistic and circumstantial evidence to the contrary, that 
was revealed as the sioiy unfolded. (For example, her brother says Til 
kill her\ and her mother advises '1 .shouldn't push your luck, madam .) 

Why then does collaborative reading encourage such pupils to be 
more open-minded and willing to alter initial impre.ssions? There are 
three main reasons. Firstly, focused discussion helps pupils to under- 
stand both the meaning of what they have read and of the c|uesiions 
asked about it. it helps to clarify what is expected of them. Secondly, 
discussion with a small group of other children gives these pupils scope 
to dex elop and synthesize their own initial ideas about what they have 
read. In doing this, they gain confidence. I'inally, the opportunity to 
fonnulale a written response collaboratively, removes their concern 
that they may make further errors in giving their response in writing. 
These are certainly some of the reasons why the less able readers in 
this study were galvanized by the process of collaborative reading to 
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amend their initial interpretations of what they had read. The pupils' 
own comments show that, in the main, they enjoyed the experience: 
the teachers' obser\'ations suggest that the pupils became more adept 
at the process of collaboratis'e reading as the terms proceeded. These 
are obvious benefits. 

When an alternative form of the Reading Ability Soics was com- 
pleted by the primary pupils in March 1993 — after a period of seven 
months, including a Summer break — the results showed a significant 
improvement in the general reading performance of the group with a 
gain in mean score of approximately 2,0 standardized points. This may 
not be wholly attributable to the collaborative reading programme, but 
the programme would certainly ha\'e contributed to the change. 

(h'onp Work in the Icachifig of Reading 

Effective group work in classrooms is far less common than is genera.lly 
supposed. The NFER sur\'ey of 7he Teachi?ig of Initial Literacy (Cato 
et ai. 1992) showed that w hile the children in the classes ()bser\'ed 
were generally seated in groups, they seldom worked collaboratively 
within these groups: nor was it common for them to engage coopera- 
tively in a task that required each pupil to provide a spoken or written 
response. 

HMI reports and other studies have pointed out that although 
children sit in groups in the majority of infant and primar\- classrooms 
there is usually no specific demand for them to work together on a 
group task or undertaking (Ti/ard et aL, 1988). Typically*, Ne\'ille Bennett 
has ()bser\'ed, pupils work /;/ groups but not as groups' (I5ennett, 
1992). fie added. The unfortunate outcome of that is a high le\el of 
low quality talk and a dearth of cooperative endeavour.' 

In reviewing research on second-language accjuisition, Peter 
Dickson noted that the increase in collaborative work in whicli learners 
work in groups or pairs has been a central feature of developments in 
classroom practice (Dickson, 1992). He added. Although there are strong 
educational reasons for group work it is also a necessarx' condition for 
creating more open interaction of the kind judged important for \2 
development.' It has been established that group work leads to more 
talk. Moreover, compared w»th teacher-fronted work, peer-groups, 
particularly pair.v appear to offer the greatest opporlunity for both re- 
ceix ing comprehensible input and producing comprehensible cnitput. 
Peer-group work also provides the best opportunities for feedback. It 
is obvious that preciseh the same considerations apply in teaching 
English to English speakers. 
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The Main Finduig 

The most important finding from this study relates to the significant 
extent to which less able readers were galvanized by the collaborative 
reading process to ainend or correct their initial interpretations of what 
they read. They clearly benefit from the activity. Given the results, how- 
ever, it would be sensible for teachers to devise group situations in which 
more able readers also have opportunities to read together collaborat- 
ively on specific occasions. 

Messages for Teachers 

The inain messages for teachers that the collaborative reading project 
points to are these: 

• Pupils need to be given frequent opportunities to work col- 
laboratively on tasks in which the overall purpose or outcome. 

' their individual responsibilities, and the form of response (spo- 
ken or written) required, are clear to them. 

• The teachers' main jobs will be to select appropriate, interest- 
ing texts for reading; devise questions about the texts that focus 
on the main issues; and provide guidance about how pupils 
should respond to tht text. In many cases this response will be 
spoken and directed to the class as a whole. 

• Needless to say, the teacher will observe the activities of the 
groups and intervene when necessar>', to facilitate the working 
of the groups. 

• Normally, this obser\'ation of the prc^cess and outcomes of col- 
laborative reading provides the basis for a teacher-led review 
of those issues that ga\'e rise to differences of interpretation 
within the groups. 

Conclusion 

Sustained, individualized work is recjuired in all areas of the curriculum 
but using language is essentially a cooperative activity. Speaking, reading 
and writing are best practised in cooperation and collaboration with 
other pupils and adults. This study demonstrates clearly that collaborat- 
ive reading is a form of group work that is of particular benefit to less 
able readers, and one that pupils in general enjoy and from which they 
benefit. 
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Summary 

Factors affecting literacy standards range from institutional 
through methodological to systemic. This chapter will consider 
the effect the Fnglish writing system has on literacy acc|uisition. 
vStarting from the alphabetic principle, it will compare such 
systems as Italian, Latin, Japanese, American and British. The 
diverse facets of irregularity in Fnglish will be related to the 
learning process, the initial teaching alphabet, and the re\*ised 
English Order of the National Curriculum. Finally, the issue will 
be placed in its hi.storical and global context, and (juestions asked 
about educational priorities and the potential for reducing sys- 
temic obstacles to impro\*ed literacy standards in the future. 



Note 

To demonstrate certain aspects of its theme, this chapter is written in 
Cut Spelling (CS) (L'pward. 1992a): some of its implications are consid- 
ered in the final section. CS is a simplified, partially regularized ortho- 
graphy for Fnglish which remox'es three categories of misleading letters: 

1. Letters irrelevant to pronunciation (e.g.. debt becomes CS dct). 

2. Letters representing post-accentual schwa with i.. m. n. k (e.g.. 
hottk\ bottom, button. become hotti hottnt. buttn. buttr)\ 
similarly in inflections and some suffixes (e.g.. iraitd. iraitti^. 
Jishs. catbl. edbl). 

^. Most doubled consonants are written single (e.g.. botti bottni. 
huttfi. butt}\ acamiwodatiou become hotl. bottn. butiK buti\ 
(lamiodattoH). 
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Other simplifications reduce the use of capital letters and apostrophes. 
Additionally, three rules of letter-substitution apply: 

1. The sound /f/ is spell f (e.g., fotograf, ennf). 

2. The vsound of j is spelt j (e.g., Jymnasi jinjr, juj). 

3. 10 pronounced as long v is spell v (e.g., high, height, sign 
become h\\ hyt. syn). 

Readers unfamiliar wilh CS are recommended to ignore unusual spell- 
ings at first and read as fluently as possible. With a little praclice, reading 
CS becomes perfeclly easy. 

This note is now repeated in CS to illustrate the system. 



Note (Cut Spelng version) 

To demnstrale certn aspecls of ils iheme, ihis chaplr is ritn in Cut 
Spelng (CS); som of its implications ar considrd in th final section. CS 
is a simplifyd, partialy regulrized orthografy for english wich removes 
thre calegris of misleadng letrs: 

1. Letrs irelevnt to pronunciation (e.g., debt becoms CS del). 

1. Letrs representng post-accentul shwa with l, m, n, k (e.g., bottle, 
bottom button, binterht^com hottl. hottm. huttn. buttr)\ simlrly 
in inflections and som sufixs (e.g., waitd. waitng. fishs. cathL 
edbl). 

3. Most dubld consnnts ar ritn singl (e.g., bottl. bottm. buttn. 
biittr. (iccommodation becotii botL hotnu hiitn. butr. 
acomodation). 

Olhr simplifications reduce ih use of capitl letrs and apostrofes. 
Aditionly, three rules of letr-substitution aply: 

1. Th sound /f/ is spelt v ic.^.. fotogmf. enuf). 
1. Th sound of i is spelt i (e.g., jymnast. Jinjr. Ji{/). 
3. K. pronounced as long v is spelt v (e.g., high, height, sign 
becom hy. hyt. syn). 

Readrs iinfamilir with CS ar recmendd to ignor unusul spelngs at first 
and read as fluently as posbl. With a litl practis, readng C^S becoms 
perfectly esy. 
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Introduction: Systms and Standrds 

Th aim of this chaptr is to encuraj al litracy workrs, wetlu reserchrs, 
leachrs or lernrs, to take a criiicl vew of th riting systms in wich htracy 
skils ar aquired, especialy in enghsh. Th centrl question askd is: how 
far ar th dificltis faced by lernrs merely a product of particulr featurs of 
th riting systm concernd, rathr than inherent in th lernng process? This 
critici vew reflects both th intrnationl perspectiv and th emfasis on 
fonics as ih sycok)jicl basis of alfabetic Htracy, wich caractrize th presnl 
work as a hole. Such a vew must be expectd to gro in importnce, as 
ih hyest standrds of litracy ar recognized to be vital for th prosperity 
and welbeing of al cuniris, as s)x*olojicly welfoundd litracy policis ar 
developd for educaiionl curicula. and as th needs of foren lernrs and 
users of english com to demand incresing atention. 

Disatisfaction with th conditions of litracy aciuisilion in english is 
nothing new, indeed it has been a continuing theme in english educaik)n 
for over iOO years (Michael, 1987). Relativly recent is ih awareness of 
ih litracy needs of al children rathr than just of a privilejd class, as is 
awareness of th role of education) standrds in deteminng a cuniris 
ability to 'succeed* in a compeliliv intrnationl environmnt. This lair 
considration especial)* has led to sevrl comparativ studis of educationl 
standrds in difrnt cuntris, focusng in varying degrees on th specific 
aspect of litracy (CKRI, 1992). Formost amongst these is th study by th 
International Association for the Evaluatk)n of Educational Achievement 
(IKA, 1992a, 1992b, 1993) across thirty-two cuniris. It is th subject of 
Chaptr 12 of this volume. Comparisns between litrac)* standrds in difrnt 
cuntris ar dificlt to make because conditions var>' so much, but th IHA 
stati.stics may alow som tentativ conclusions about th efecl of difrnt 
riling systms, in particulr about th extent to wich mor dificlt riling systms 
corelaie with loer standrds of litracy. Initial obsr\-ations from th sur\w 
wer that Finland, with an exeptk)nly regulr riting systm, acheved th 
hyest litracy standrds, wile Denmark, with a less regulr riling syslm, 
fared less wel (Allerup, 1993). Cerinly th hypothesis that iregulr riting 
systms ar hardr to lern, and that systms that ar hardr to lern wil ineviibly 
produce loer standrds of litracy, is liyly plausbl in lojic, and if a causl 
relaik)nship can be demnstrated. llier wil be importnt implications for 
litracy policy in english. 



Method: Clarifyng Concepts 

This chaptr begins and ends with english, and it is therfor first necesry 
to take issu with and hopefuly help clarify a numbr of notions widely 
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Wnting Systems in Different Languages 
Table 5. 1: Thre senses of th word 'speing' 



1 Uses of word 



2 Meanng in context 



3 Wat IS speing? 



'english speing' 
'practisng speing' 
'atrocius speing' 



Conventions for riting words 
Lernng how to rite words 
Floutng conventions 



= SYSTM 
= ACTIVITY 
= ACURACY 



held in th english speakng world about litracy in th english riting systm, 
wich efectivly means its speing. 

Th word 'speing' itself is subject to confusion, ofn being equaled 
with Tiling'. Bui insofar as it is th medium for litracy jenrly, speing 
concerns readng as much as riting. labie 5. / anlyzs som importni senses 
of speing', colum 1 shoing varius uses of th word, colum 2 parafrasing 
ther meanngs, and colum 3 defining them in terms of elementry con- 
cepts. We shal in this chapir be concernd chiefly with speing as a systm 
wich has two functions: to enable riters to comit words to visbl form 
(brail and morse ar exeptions); and to enable readrs to recognize words 
in text. But a premis for both these functions is that th speing systm 
must be lernt. Sycolojidy, th two functions constitute a unity, utilizing 
th same systm and oprating in th same minds, and wen one function 
is practisd. it reinforces comand of th othr. This unit)' is too ofn overlookd 
in english and th functions themselws even confused. Thus th 1993 
revised proposals for english in th english/welsh National Curriculum 
(Department for Education, 1993) seprate readng and riting as Atainmnt 
Target 2 and Atainmnt Target 3, but classify th task of chekng spelngs 
in a dictionry undr readng. not riting. That riting may be an efectiv way 
of inculcating syt vocablry for readng is typicly neglectd. We shal later 
note th success of italian children in lernng to read initialy by riting — 
an aproach wich has ocasionly also been found useful in english 
(Martin and Friedherg. 19H6). 

Ciivn this uncertnty about th concept itself, it is not surprising that 
recent perceptions of english speing hav difrd markdly too. Befor th 
mid-twentieth centurv*, linguists, including som ho had studid th forms 
and historid evlution of english speing in depth, mostly recognized that 
english speing was hyly ireguir, even caotic' Lately, howevr, many 
linguists hav been reluctnl to accept that english speing cud be seriusly 
defectiv, and th fact of its evlution was sointinies taken to imply it 
must be idealy adaptd to modrn needs (sur\'ival of th fust?) (Chomsky 
and Halle, 19(>H). In prefrnce to Xaotic', a range of epithets such 
as 'polysystemic' (Albrow, 19*72), Mogografic' (Sampson, 198S), 
*morfofonemic' (Slubbs, 1980), or 'deep' (Aaron, 1989) hav been aplyd 
in an atemt to sujest som ordr in th systm, Po.sbly th pendulum is now 
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beginng to swing bak — a very recent work at least professes neutrality 
on th question (Carney. 1994). Wethr ordr or disordr is in fact doninnt, 
may be jujd by evidnce presentd later in this chaptr. 

Sycolojists hav been inor concernd with how lernrs internlize english 
spelng patrns. Evidnce has been garnrd particulrly from studis of th 
abnorml — th dificltis of dyslexics, th spelng of nonwords — and varius 
theoreticl inodls hav been proposed. For instnce, th 'dual process modi' 
sujests that readrs treat regulr and iregulr spelngs difrntly, regulr spelngs 
being decoded 'orthograficly' (i.e., by interpretng th sound valus of th 
letrs), wile iregulr spelngs ar recognized 'lexicly* (i.e., hole words ar 
machd against tlier imajs as stord in th readrs word-memr\', or iexicn') 
(Henderson, 19H4). Anothr modi proposes stajes in lernng to read, th 
beginr recognizing words as holes (a iogografic' staje), th oldr readr 
decoding letrs into sounds (an 'alfabetic' staje), and th skild readr dealng 
with al aspects, regulr and iregulr, of th riting systm (an 'orthografic* 
staje rathr difrnt from that proposed by th dual process modi) (Sterling, 



Ther ar soni fairly obvius objections to such modls. English spelngs 
ar not esily divisbl into regulr and iregulr (e.g.. paid is regulr by analojy 
with but iregulr by analojy with played). Yung and adult readrs 
alike read al familir spelngs autinaticly ('lexicograficiy') on syt, but hav 
to decode ('alfabetidy*) al imfamilir spelngs. regardless of regularity. 
Fuithrmor. teachng methods may influence, perhaps even determn, 
how lernng proceeds: a fonic method or a structurd readng sceme may 
produce difrnt stajes or processes from a hole-word method or real 
books . And behind al these objections is th (|ue.stion wethr a coherent 
description of lernng processes is posbl in english at al, wen th most 
varid iregularitis lurk as potential traps for th lernr at evr\' turn. To 
parafrase a sixteenth centur\' obserx'r of th problni: u canot cliscovr how 
children normly lern to walk if u only alow them to go on a tytrope or 
on llier hands (Hart, 1570). 

A mor helpful modi for nornil processes of liiracy aciuisition (i.e.. 
in a reasnbly regulr riting systni) is implyd by Steven Roses findngs 
about th natur of memiy (Rose, 1992). His neurobiolojy tels us that 
sense impressions (e.g., ol th spelng and sound of words) leve fysiologicl, 
chemicl and electricl traces in th brain. If these impressions ar repeatd. 
th traces ar reinforced, and synapses (links between nerv eels) ar 
developd as rules wich facilitate subsec|uent reprocesng of those impres- 
sions. A regulr spelng systm wud thus constntly reinforce fewr syna|)ti( 
links formd during lernng, but a hyly iregulr riting systm like english 
myl send mor complex mesajs lo th brain, les.s stable .synaptic links 
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wud develop, and ih resullng readng skils w ud be that much less relybl 
(compare Magdalene Vernon's 'cognitiv confusion , Vernon, 1957). This 
\synaptic* modi of th sycolojy of litracy aquisilion is also perfectly 
compalbl with th 'evlutionr>'* acoiint of brain clevelopmnt proposed by 
th Nobel prize winr Gerald Edelinan. Evidnce of hyr standrds of litracy 
in reguir riting s>'stms wud furthr confirm its' apropriatness. 

Educationists vews of english spelng ar equaly varid, Flistoridy, 
leachng methods hav ranjed from lernng th ABC, readng thru spelng, 
sylabic analysis, spelng lists, and dictations (Michael, 1987). Fonics is 
advocated today on th grounds that children lern much fastr wen they 
no th letr sounds and can work words out for themselvs' (Lloyd, 1992); 
but this is oposed by a *hole word' aproach on th grounds that th mor 
words u no, th esir it is to recognize and lern othr words, based not on 
fonic corespondnces, but on sylabic and semantic resemblnces' (Smith, 
1991). Th title of Margaret Peters book Spelling: taught or caught/' 
implys a simir dictoniy (Peters, 1985). Nor is th british gox'rninnt consistnt: 
its Kingman Report (DES, 1988) specifyd that 16-year-olds shud \spel 
corectly\ since 'spelng obeys rules', but its Cox Report (DEvS, 1989) 
stated that 'th aim canot be th corect unaidd spelng of any english word 
— ther ar too many . . . that can each out even th best spelf — in othr 
words, english spelng ofn dos not obey rules. Penalizing mispelngs in 
examnations wud thus stigmatize pupils for failng to acheve th imposbl. 

How is it that english spelng. th worlds prime medium of ritn 
comunication. has jenrated such disagreemnt about its natur. about 
litracy aquisition, and about teachng methods? One mesaj of this chaptr 
is that th sheer iregularity of english spelng obstructs not merely lernrs. 
but also linguists, sycolojists and educationists atcmtng to undrstand it. 
Because ther is so lit! sense in tli systm, it is hard to make much sense 
out of it. Th only way to explain english spelng is thru its chekrd histr\\ 
Ther we se letrs insertd in k)an words to reflect ther languaj of orijn 
(e.g.. M in (let), or insertd merely by anak)jy (e.g.. i. in anulhy analojy 
with would), or 'by mistake' (e.g.. v in ptarmigan), or one letr taken 
from french (e.g.. a in resistant) contlictng with a difrnt letr taken from 
latn (I-; in consistent). We se formr alternati\' spelngs sur\'iving as anomlis 
(e.g., speak speech, deign/clisclain) cr>'ng out to Ix^ harmnized. Th work 
of tb sixteenth centur\- scoolinast/s Coote and Mulcaster apart (Mulcaster. 
1582; Coote, 159(t), lernrs and users needs hav scarcely inipinjd on th 
desyn of th systm at al, indeed since ]()()() ther has been no coherent 
overal ^lesyn'. Hence th title givn by th duch linguist Xolst Trenite to 
his 25()dine lymng catlog of orthografic inconsistncis in englisli: The 
r;/v/av (Nolst Trenite, 1929). 
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Results: Consequences of Unpredictbl Spelng 

That ih auihr of The Chaos was not a naliv speakr of english was no 
acxidnt. Wher nativ speakrs, acustmd lo ih sysim from iher erliesl years, 
lend passiviy lo acxepi its iriiaiions and frustraiions, non-naiivs liirai in 
oihr languajs mor ofn react with impatience. Thus ih linguist Mario Pei 
first recals ih ese of lilracy aquisition in italian, befor he moved to 
America at aje 7: 

U ar taut ih alfabet. then u ar givn sequences of spoken and rim 
sylabls . . . Ther ar a few confusing moments wen u ar taut to 
insert an n aftr t.. o. m:, and an i aftr th same consnnts, to sho 
certn sounds befor front or bak vowls. Beyond that, yr ear is a 
gide to yr spelng if u speak standrd italian . . . Th word spel dos 
not exist in th italian vocablr>% wich is a clu to th entire situation. 

lie then recounts (Yule, 1991) his dismay on meetng english, wich he 
has described as 'one of th worlds most awsm messes . 

Tho th polyglot huiigarian Julius Nyikos was adult wen he mi- 
grated to th USA, his reclections ar sim\r (Nyikos, 1987): 

I, as wel as e\*ry olhr hungarian child without serius lernng 
difidtis, lernd to read any and al hungarian texts fluently by th 
end of th first year of elementrv* scool ... I lernd to read latn 
corectly at th aje of ten within one week, and jermn at ele\'n in 
less than a week. So did al th othr hungarian students studying 
in th Gymnasium that I atendd. At forteen I startd lernng to read 
and rite english, and made som progress. At university ... I lernd 
tc) read and rite finish in a few ours, as did al my felo students. 
In 1949 ... as soon as I arived in th United States, I resumed 
lernng to read and rite englisli, and I hav been making modral 
but stedy progress since then. At this rate I hope to hecom 
comfrtbl with english orthografy in anothr decade or so . . . 
(Nyikos, 1987) 

(Se Volume 1, Chaptr S for a contempr\' acount from Hungr\'). 

Wy shud non-nativ speakrs be mor struk by th difictty of english 
spelng than nativ speakrs? An importnt reasn is that non-nativ speakrs 
ha\' an aditionl hurdl to overcom wen they lem english. For nativ speakrs, 
readng jenrly means relating spelngs on th paje to pronunciations alredy 
siord in inemr>'. Non-nativ lernrs, howevr, mor c omnly encountr spelngs 
til pronimciation of wich they canol so esily relreve from memr>\ and 
they ar therfor mor dependnt on spelng to sho pronunciation. That is 
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Writing Systems in Different Uifiguages 
Table 5.2: Spelng-induced mispronunciations by non-nativ speakrs of english 



Pronunciation Modi Mispronounced Word 



nation, sensational 


national 


baked, faked 


naked 


fear, hear, near 


bear 


undermined 


determined 


brother, mother 


bother 


cove, stove, drove 


dove {= bird) 


brown, gown, town 


own 


cowards, Howards 


towards 


woo 


who 



\vy Nolst Trenites n?e Chaos laments th unpredictbl relationship be- 
tween spelng and sound: corect pronunciation of th ritn english word 
is a minefield. Anothr aspect of th problni is seen in a widely used 
bilingul Spanish and english dictionr>' (Smith. 1971): evry english word 
has its pronunciation sepratly spelt in th Intrnationl Fonetic Alfabet 
(itself inconvenient for most users to decyfr), but th pronunciation of 
the Spanish words is transparent from ther norm! spelng. Mispronuncia- 
tion of english words by non-nativ speakrs is undrstandbly rife — Table 
5.2 givs soni exampls of such spelng-induced errs. Not only non-nativ. 
but nativ speakrs too need spelng to sho pronunciation, and th failur 
of english spelng to do so reh'bly is seen to restrict litracy potential in 
varius ways. E\'en hyly litrat nativ speakrs hav th comn experience of 
discovrng as adults that they hav mispronounced certn words since 
childhood. Travlrs ar frequently nonplusd by th spelng of place names 
on maps or syns. Th less wel educated ar inhibitd from public speakng 
by fear of ridicule for mispronouncing. Dificlt spelngs ha\- been shown 
to deter yung peple from expandng ther acti\' vocablrx' (Moseley. 1989). 
Such limitations on litracy ar al con.sequences of iregulr spelng. 

Yet th problm is not suficiently aknolejd. Al too typicl and influen- 
tial was Chomsk\- and Halles remark that orthografy is a systm desynd 
for readrs ho no th languaj* (C:homsk\' and Ifalle. 1968). a statemnt to 
wich we must object in almost e\ r\- particulr. First, we alredy rejectd th 
notion of desyn" in modrn engli.sh spelng. Secnd. th esential systemic 
and sycolojici unity of readng and riting means that orthografy must 
sen- both readrs and riteis: tiicy ha\- an ecjual need for predictbl sound- 
symbl coresi:)ondnce, for decoding and encoding words respecti\iy. 
Third, no orthografy can he desynd only for peple ho no th languaj: 
evrv'one first aproachs languaj as a lernn and we continu lernng thruout 
our li\-es. Most importnt of aK th role of english as a wodd languaj 
demands it shud also be desynd for those ho aproach th languaj as 
non-nativ speakrs. That is in eviyones conumicativ intrest. 
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Interpretation: How Diftclty Afects Utracy 

We hav repeatdly referci to ih dificity of ih english riling systm arising 
from its lak of predicibl sound-syinbl corespondnces, but we hav not 
yet made any ver>' exact comparisns between languajs. A numbr of 
siudis hav compared htracy aquisition across languajs, covrng th roman 
alfiibel and othr alfabets such as cyrihc (both russian and Serbian), 
hel')rew and arabic, as wel as non-alfabetic systms such as th japnese 
sylabris, and Chinese caracirs. j Downings colection Comparative 
Reading ist^ also his contenipraneus paper Is Literacy Acquisition Easier 
in Some Languages than in Others?*) (Downing, 1973a, 1973b) dos not 
make statisticl comparisns, but th gretr dificlty of english (or rathr th 
gretr ese of othr languajs) is repeatdly aluded to in remarks such as Mt. 
seems to be jenrly beleved that children just do not find it dificlt to lern 
to read' in, for instnce, Finland, Jermny and Norway (p. 107). Othr 
siudis, from America. (Tarnopol, L. and M., 1976, 1981; Stevenson et ai. 
198S), particulrly compare litracy standrds in Chinese, japnese and 
english; noteworthy from Britain is Henderson (1984). 

A recent paper by th presnt authr (Upward, 1992c) and especialy 
one by Thorstad (1991) giv mor precise statisticl evidnce. Comparing 
spelng acuracy in english and jermn, th formr found that, in terms of 
mispelngs, english apeard almost sevn times mor dificlt than jerman, 
with redundnt letrs causing most errs. Thorstads study compared litracy 
aquisition by english and italian primary' scool children, and found that 
ill italians, ho had mastrd ther regulr oithografy by systmatic atention 
to fonics, acheved scors sevrl times betr than th english (se Mario Feis 
experience of both languajs described abov). Ther superiorit>' was shown 
b\' a numbr of tests, perhaps th most revealng requiring th children to 
read and rite eit words of adult vocablr\\ Th results ar shown in Table 
5.x Over four times as many misreadngs and nearly eit times as many 
mispelngs ocurd in english as in italian. Obsr\*ations included th foloing: 
Th italian children spent most time in riting in th first year, and readng 
wat they had ritn. Ther wer no graded readng books . . . Ther was no 
remedial teachng/ (p. S29) italian children take one year to acheve in 
readng and spelng wat takes english children thre to five years. Ther 
wer no tecnid limits to watevr ihey myt want \o read or spel. As ihey 
ar confidnl in ther skils, they aproach these tasks mor systiiiaticly and 
without th anxiety jenrated in th english children.' (p. S3S) Th italian 
children used a fonk)jicl stratejy in both h'cadng and riling].' (p. S3()) 
Th numbr of symbls (lelrs) to be lernl in italian and english is 
virluly lli same, and th difrnce in achevemni arises from th much gretr 
lernng load recjuired in english, wher ih use of ih letrs has to be mastrd 
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Table 5.3: Results of Thorstads anglo-italian S-word test 



Word % misred % mispelt 



english 


Italian 


english 


Italian 


english 


italian 


cement 


cemento 


25 


8 


77 


3 


correct 


corretto 


33 


0 


62 


17 


literally 


letteralmente 


45 


7 


99 


17 


perceptible 


percettibile 


61 


17 


99 


19 


permits 


permette 


31 


3 


65 


12 


preparing 


preparano 


24 


4 


79 


6 


special 


speciale 


16 


4 


81 


4 


thermometer 


termometro 


28 


17 


83 


6 



No. of english errs per 4.38 7.68 

italian err 



w ord b\' word for thousnds of words, rathr than sound by sound for a 
few dozn sounds, Th lernng load in soin othr riting systms is even gretr 
than in english. and th limitations on litracy corespondngly even siarkr. 
Wile alfabets ar closed systms, with a snial, fixd numbr of symbls, 
Chinese has an open systm, Ther ar sevrl tens of thousnds of caraclrs 
(a larj dictionr>^ may list around 50,000 — Lindqvist, 1991), but most ar 
outside th reprtoir of th avraj Chinese, for horn 4,000 to 7,000 sufice in 
ordnry readng (Che Kan Leong, 1989). Altho many caractrs hav comn 
elemnls wich asist both lernng and readng, th time, memr>^ load and 
complications involvd in acheving and maintainng even elementry litracy 
in Chinese ar far gretr than required by regulr, closed systms. China, 
howevr, has taken two major steps to facilitate litracy a(}uisition. First, 
initial litracy skils ar taut in pin-yin (= spel sound'), a fonografic riting 
systm based on th roman alfabet, so that children alredy hav th necesr^- 
basic s>'com()tor skils befor they takl traditionl cliinese caractrs, Secnd, 
in ill 19S0s and 1960s as many as 1.7S4 of th most comn caractrs wer 
simplifyd to aid litracy (Yue E Li and l.-pward. 1992). 

Th pin-yin aproach. of givng bcginrs basic litracy skils befor they 
face th ful difidtis of th adult riting systm, has been aplyd in english 
loo. most recently thru th Initial Teachng Alfabet (ita), but using othr 
regulrized spelng s\'slms loo, always with striking success for initial 
litracy aquisition (Tpward, 1992b). Th secnd aproach, of simplifyng th 
adult systm, has not recently Ixa^n aplyd to english — Noah Websters 
spelng simplifications in America in th erly nineteenth centur>' (Clark, 
19()'>) wcr th latest efectiv atcmt (e.g.. regulrizing trarcllcrio travclerhy 
analojy with wamlovr and r/^w/rto r(i;'o;*by analojy with rigorous), 
Iho its benefits hav only sloly perdated across th Atlantic and hav been 
only eratidy adoptd in Canada and Australia, Both these aproachs to 
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facilitating litracy in Chinese ar rele\-nt to problms of liiracy aquisition 
in english today. 

Th valu of an initial teachng alfat')et is also aparent in japnese. Th 
cor of th japnese riting systin consists of Chinese caractrs, but they ar 
used in unpredictbly difrnt ways, and mastrng them is even mor dificlt 
than in Chinese, Despite these complexitis, Japan has long claimd 
outstandng litracy standrds (Makita. 1968), These may not be entirely 
du to long ours of disiplind, concentrated memrizaiion of th Chinese 
caractrs. since japnese also uses som fort^ eit sylabic symbls wich function 
fonograficly. each symbl reguldy representng th same sylabl. This systm 
is so straitforwrd that most children can alredy read and rite it befor 
they start scool. We may again reflect on th significnce of this hed start' 
in th lyt of John Downings findngs on th transfer of litracy skils from 
th i.t.a, to traditionl english orthografy (Downing, 1987). By his inter- 
pretation, th ke to th rapid, succesful aquisition of litracy skils in engli.sh 
is for children first to de\'elop th basic sycomotor skils using a regulr 
fonografic riting systm, and only wen that is mastrd shud they confront 
th aditionl dificltis entaild by a complex adult riting systm such as that 
of english. 

Mesajs for Teachrs: Th Need for Chanje 

One of th ins\ls provided by th i.t.a, experimnts was that litracy aquisition 
in english can be esy, rapid and exiting. Provided spelng is regulr, 
lernng to read and rite givs an enormus siimulaiion to childrens powrs 
of self-expression and educationl morale in jenrl. Italian and many olhr 
languajs send th same mesaj, Th presnt iregulr spelng of english by 
contrast se\'erely discurajs larj numbrs of less able children, and causes 
al lernrs and users a lifetime of unecesr\' trutM. Those ho pronounce on 
educationl standrds. wethr teaclirs or educationists, employrs or politi- 
cians, need to undrstand wy beginrs stumbi in english. wy even ad- 
\ anced students make fre(iuenl spelng mistakes, and how far short of 
its litracy potential th english-speakng worid fals. Students shud not be 
denigrated as .stupid or la/y, nor teachrs as incomjx^tent: // is th ritinj^ 
systm that is uustdthl for Ivnirs. and nat lenity ho ar utisuithl for th 
rilinj» .systm. Mastrng a riting .systm has no \ atu for its own .sake — it 
is striclK* a tool for comunication. If a tool is manifestly unsuitt')l. being 
clumsy and awkwrd in th hands of nio.si users, it is adaptd or replaced. 
Th same is normly tru of riling sy.sims. \\V saw how chinc.se has taken 
steps both to help initial lernrs and to simplify th adult riting systm. 
Simlrly, mo.sl languajs with alfabelic riling sy.sims lia\' from liiiie to time 
modrni/ed ther spelngs too, .so as belr to reflect curenl pronunciation. 

m 
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Table 5.4: Sound-valus of letrs in th classic! roman affabet 

Letr Sound valu in (atn (* = anomlus sound valus) 

Vowls /a, e, o/. but long and short valus not difrentiated*. 
A/E/0 

B Ibl 

C Always /k/. nevr /s/; Cicero = /kikero/ 

D 161 

F /f/ 

G Always /g/. nevr /d^; genius = /genius/ 

H Silent*: honor = /on or/ 

I (also J) Long and short /i/. 

Also* l-glide /j/, pronounced as english consnanti Y. 

(K) Rarely used in latn, but pronounced like C as /k/* 

L /I/ 

M /m/ 

N /rV 

P /p/ 

Q + V ( = U) /kw/ (Q itself pronounced as C and K*) 

R /r/ 

S Always /s/, nevr /z/. 

T /t/ 

V (also U) Long and short /u/: also* W-glide. 

X /ks/ (/k/ as also C, K, Q*) 

Y, 2. Ch. 

Ph. Rh, Th AditionI letrs & digrafs used in latn for translitrating greek 



Thus, in 19 i7 diich siinplifyd ih endng lo lurnn^ tor insincv ih 
old form }wcic)ia}idsch *ducli* into ;/cWc;7^o/fA* (Trouille. 1987); and in 
1959 Spanish aiowd th silcnl initial v lo he dropd from psicologfa lo giv 
sicohgia (Smith. 1971). English by contrast has neglecld thai e.scnlial 
task of modrnizalion for cunliiris. and pays ih incviibl price. 

Il shud be of inlresi to leaehrs to undrsiand how ih en^hsh riling 
syslm has becom so confused. Th alfabel. invenid around 3.700 years 
ago (Hooker. 1990). evolvd thru fenician. greek and elruscn. befor ih 
romans ga\e il most of ih leir shapes we ar famihr with in luodrn 
engUsh. In classic! lain limes, around ih year -100. it had ih lelrs givn 
in Table 5.^1 with sound valus unambiguus exepl for those asiriskd. Th 
classicl roman alfabel was iluis a mostly clear, simpl syslm. orijnh' with 
no digrafs like english ( n. mi. rii etc.. and il has since lent itself fairly 
wel. w illi som adilions sucli as new letrs. digrafs and diacritics (accents), 
to th represnlaiion of many oihr languajs, one of ih erliesi being 
anglosaxn. IIowcnt. it did not distinguish cerln sounds wich modrn 
english dos distinguish (e.g.. long and shoit \*()\\ls). and it inheritd 
ceiln fonlojici distinctions wich english ignors (e.g.. (. k and g for 
variatils of th sound /k ). Latn later lost .som of its orijnl simplicity, wen 
for in.stncc th valu of th letr ( sp\\\ between /s lu'l or 1 . I. > atid til ()ri)nl 
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Table 5.5: How do u spel A/ m english? 



Do u spel /k/ as 
C. CC, CH. CQ. CQU. K. KK. Q or QU 
and wy? 



c 


cat, arctic, arc, arcing 


cc 


soccer, hiccup, impeccable, account, occur, succumb, accursed 


CH 


chaos, school, technology, ache, monarch, stomach 


CK 


rock, rocky, chicken, hillock 


CQ 


acquaint, acquiesce, acquit 


CQU 


lacquer, racquet 


K 


king, weaken, yak 


KK 


trekking 


Q 


queen, quick, squeal 


QU 


quay, conquer, unique, mosque 



/k/ elsewher, so that th first sylabl of Cicero chanjed from /kikV to 
inodrn english /sisV. Th anglosaxns used th roman alfabet to create a 
workmanlike riting systm wich by and larj showd th pronunciation of 
words, indeed it is esir to lern to read old english than modrn english. 
Th invasion of french spelngs aftr 1066 efectivly destroyd that systin, 
and vvat resultd was an unsystmatic hybrid to wich furthr elemnts wer 
unsystmaticly add from othr languajs, abov al greek, in evr incresing 
numbrs especialy from th fifteenth centur>- onwrds. Around th seven- 
teenth century many loose ends wer tidid up, as wen th form hit be- 
came standrd in place of mor complex variants such as hytte: but 
countless iregularitis remaind and soni new ones wer even introduced, 
such as Samuel Johnsons ache for ake Apart from Noah Websters 
abov-mentiond reforms in America, litl has now chanjed for two or thre 
hundred years, with th consequences for litracy alredy described. 

Th extreme disordr of modrn english spelng is wel ilustrated by th 
difrnt posbilitis of spelng th sound /k/. as shown in Tahle 5.5. Som 
defendrs of th presnt spelng systni claim a redeemng featur in its alejd 
inorfofonemic* stability. By this they mean that th root of a word 
maintains consistnt spelng even if its pronunciation chanjes, as wen (to 
quote an exainpl givn by Chomsky) courageous preservs al th letrs of 
til base word courage altho its vowl sounds ar radicly difrnt. Howexr 
7'ahle 5.6 shos how. in man\' insmces, english spelng dos .lot even 
maintain morfofonemic stability. For exampl. a rare instnce of intendd 
simplification, th removal of final k from words like /ro//c*fc in th nine- 
teenth centur\', in fact produced a morfofonemic complication, in that 
it rec|uired k to be insertd befor sufixs, as \n frolicked. Th basic point 
of both Tahle 5. 5 and 5.6 is to sho how complex is ih task facing lernrs 
and users of ritn english. Th variations in ih spelng of /k/ ar eseniialy 
unpredictlM, wich means that, despite recurent patrns givng som gidance. 



W7iting Systems in Different Languages 

Table 5.6: Morfofonenvc variation invofvng A/ m english speing 

Ar /k/ spelngs morfofonennicly stable in english? 



C or CH? acrostic/hemistich 
C or CK? beacon/beckon, blocA)lock, panic/panicked, 

hammock/havoc 

C or K? cat/kitten, cow/kine, curb/kerb, 

provocation/provoke, disc/disk, Celt/Kelt, 
sceptic/skeptic (US) 

C or QU? licorice/liquorice 

CH or K? autarchy/autarky, feuchaemia/feukaemia 

CK or QU? check {\JS)/cheque 

CK or CQU? lackey/lacquey, racket/racquet 

K or QU? embark/barque, mask/masque 



ih spelng for each word containng th sound /k/ has to be lerni 
individuly. Yet /k/ is only one of forty or so sounds that make up ih 
fonolojy of english, and th variations shown abov only concern spelng, 
i.e., th process of riting. Th process of readng involvs perhaps even iiior 
complex variations, including for instnce, stress-asynmnt by th readr 
wich is not shown by th spelng at al, as wen a present/to present hav 
th same spelngs (Cut Spelng distinguishs presnt /present). Few othr 
languajs tolrate a riting systm that is remotely as unpredictbl as english. 

Th most importnt mesaj for teachrs is that they shud not be satisfyd 
with th presnt spelng systm. Wenevr they obser\' lernrs havng dificlty, 
they wud do wel to ask wat is causing th dificlt\'. By noting and classifyng 
exampls of dificlty, teachrs wil both com to apreciate th problms of th 
systm betr and be betr able to help iher i:)upils. They wil becom aware 
that redundnt letrs giv rise to a majority of errs, and may decide to tr>- 
spelng pronunciation as a device for helpng lernrs memrize dificlt spelngs 
(as suje.std b\' Drake and Ehri, 19S4), or they may develop othr tecniqes 
of iher own. 

Howevr, in th k)ng term, ther can be no othr solution but to work 
for til modrnizatk)n, .simplification and regulri;^ation of th spelng systm 
itself This is a matr hos ramifications mosll>' lie outside th scope of 
individul teachrs, tho they may hav som limitd oprtunitis for action; for 
instnce. teachrs may encuraj pupils to use regulr americn spelngs in 
prefrnce to iregulr british equivlnts (mor daringl>', one formr teachr 
nown to th authr for many years delibratly taut sinprizv in prefrnce to 
stipppise, but unconirold individul initiativs of this kind wud be a recipe 
for even gretr caos). Mor jenrly, one initiativ teachrs cud take is to 
discuss among themselvs wat reformd spelngs they wud juj most useful 
for th lernng process, develop a consensus on prioritis, and put forvs rd 
Avanld lists' thru ther professionl l*K)dis for considration by th aproprial 
aulhoritis. It was noticebl that th primary ycool teachrs union in France 
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thus suportd ih 1990 french speing refomis (Baddeiey, 1993). For leachrs, 
mor than anyone, stand to gain iniediatly from any esing of th task of 
litracy aquisition. 

Th Cut Speing in wich this chaptr is ritn is an exampl of th kind 
of experinintation and inovation that is now needd. Unhke som erhr 
proposals for th root and branch respelng of english from first principls, 
CS takes th presnt speing as its startng point. Its procedur is chiefly th 
paring down of overelabrat traditionl spelngs by removing redundnt 
letrs, i.e., those that ber an unpredictbl relationship to th pronunciation. 
This not merely brings advantajs of econmy (riting becoms faslr and 
takes up less spac^), but it targets th most serius dificlty of convenlionl 
english speing. It is dificlt enuf wen non-standrd letrs ar used to rep- 
resent sounds (such as th .\ in CDiw h in prett\\ v in busy, etc), but lernrs 
face a dubl dificlty with letrs u'ich do not represent sounds at al: in 
these cases, not merely must th corect sound valu of th letr and th 
corect letr for th sound be lernt, but so must th letrs position in th word. 
That is \\y, in a word like business, th chief problm is not th aberanl 
r, but th redundnt i, Teachrs wil find it helpful to identify how many 
of th problms ther pupils face arise from such redundncy, and considr 
wat tecniqes (e.g.. speing pronunciation) may be most helpful in 
overcomng them, 

Aftr som decades wen it was rathr neglectd in education, english 
speing is now receving new atention. aroused by reports of declining 
litracy standrds. At th same time, reserch into readng processes is 
incresingK' confirmng th iniporince of fonics for th lernng process.- By 
th lojic of this chaptr, th contradictions between th dificlty of english 
speing. th need for improved standrds of litracy, and th aplicaiion of 
fonics (fonics onl\* works proprly wen speing represents pronunciation ) 
shud be evidnt, and iead on to considration of ways of resoK ng them. 
Th time may now be ripe for action, as it has perhaps nevr l^een befor. 
but as yet th necesr\' orgnizationl structurs do not exist. Th minute, but 
intrnationly based. Simplified Spelling Society, is only a jinjr group — 
it can ajitate for action but it canot itself' introduce chanje.' A brodr 
groundswel of infoniid and influential opinion is needd. Perhaps a 
gretr undrstandng of th relationship between riting systms and litracy 
standrds. such as this cliaptr has atemtd to create, wil help jenrate that 
groimdswel. 

Notes 

1. l-or cxmnpl. JaiiU's A.I I Murray. Tusl cdili of ih Oxjhnl Ffti^hsh nk ti(i>uin\ 
WW ske.ii. ctnnnl scolt of Old aiul Midi English aiul tiisi pivsidnt of ih 
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Simplified Spelling Society; Daniel Jones, Harold Orton (1948), as revisers 
of th Netv Spelling proposal for englisli spelng reform in its 6th edition^ 
London: Sir Isaac Pitman and Sons, Ltd. for the Simplified Spelling Society. 
2. For exampl, th work of Linnea Khri (Cit>' Liniversity of New York) over 
many years; se also Vincent Connelly. University' of St Andrews, doctoral 
thesis in progress. 

3- Simplified Spelling Society. Secretary Bob Brown, 133 John Trundle Court. 
Barbican. London EC2Y 8DJ, Tel 071-628 5876, Fax 071-628 9147. 
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Summary 

A description is given of *Kqual-Fliis\ a new initial-leaching 
orthography currently being developed. This ()rtlK)graph>' bases 
its rationale on the sequence of competencies through which 
children are now known to pass in learning to read and spell. 
It makes use of such reguhuity as already exists in conventional 
Knglish spelling and minimizes the need for 'unlearning . Refer- 
ence is made to pilot studies which show that the system is easy 
to learn. Fvidence is given wiiich suggests that its early use 
might fiicilitate reading imd writing in conventkmal orthography. 



Introduction 

C.hildren seem to recogni/e their first few written words through sek'c- 
ti\ e ass()ciatk)n (Ciough. juel and Griffith. 1992). They will select for a 
word some arbitnny visual attribute which distinguishes it from words 
previously noticed. For instimce they may notice the shape at the 
lieginning of a name (Razwana). the matching 'ee* in 'bee*, tiie double 
hump Lit the beginning of \\kinf or the tail on the end of donkey*. 

The trouble is that reading by selective association, or *visuLil-cue 
reading* lis it sometimes called, nipidly incrcLises in difficulty as more 
and more words luv encounteretl until soon further progress clui only 
be made with herculcLm effort. Typically, there is a deteriorLition in the 
abilit\' to IcLirn to read new woreis (e.g.. Byrne. lTeelx)dy and Ckaes. 
1992). Hence Hertelson ( !9S(>) descrii>es this initiLil method of reading 
as 'non-productive'. In default of other stnitegies. children may persist 
with it for li long time: Ixit to l)ecome skillcLl rciulL^rs. tliey need lo 
inlernLiii/e the system i»y which elements of the written language 
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(letters and letter sequences) are mapped onto elements of the spoken 
language (phonemes and phonemic sequences). Without transition 
to a type of reading which makes use of correspondences between 
symbols and sounds — a change to what is called by Gough, Juel and 
Griffith (1992) cipher xf^^^dm^. by Chall (1979, 1983) Stage 1 reading (as 
opposed to Stage 0 reading) and by Frith (198S) alphabetic reading — 
there is a real danger they will end up reading extremely poorly if at 



How though is this transition best brought about? Ehri (1987) has 
elegantly demonstrated through a series of experiments the extreme 
implausibility of the argument that exposure to en\'ironmental print 
itself leads children to make letter-phoneme links (See Volume 1, 
Chapter 1). The dramatic differences which can be achieved through 
phonological-awareness training ha\'e been demonstrated by many in- 
vestigators (e.g.. Stuart and Masterson, 1992), and there has been much 
work showing how such training can be combined with the teaching 
of letter sounds (e.g.. Bryant and Bradley, 1985: Byrne and Fielding- 
Barnsley, 1989: Byrne. 1992: Stuaa. 1993) and how code-based pro- 
grammes can be integrated with teaching to improx e comprehension 
skills (e.g., Moran and Calfee. 1993). What has receiwd relatively little 
recent attention hoN\'e\'er is the possibility of modifying the English 
orthography used by learners so as to make learning to read and write 
easier (see Chapter S). 

This is the aim of Fxiual-Plus, the new initial-teaching orthograpiiy 
the writer is currently de\'ising. There are four sui')sidiary ambitic^ns: 

1. to improve phonological awareness: 

2. to impro\'e children's competence in matching written words 
and \\ rillen-word elements with words and word elements in 
their spoken lexicon: 

3. to facilitate the learning of the main phonographic correspond- 
ences of our present spelling system: and 

L to provide information abtuit the general sirucuire of this system. 

Fcjual-Plus bases its rationale on the secjucnce of competences 
through which children arc now known to pass in learning to read and 
spell (Read. 19"!. UrS, 108(>: Sterling. U)S3; c:atatdo and I^llis. I WO: 
Fhri, IWI: Schlagal. 1W2: Freiman. 1993). Flxploiting such regularities 
as already exist in traditional orthography (TO) (Furies. I0()3: \'eiie/ky. 
Uro: Albrow. I^)*^2: Finks/. 19*^3: fleniy. 1988: Carney. 199 U. Vx\m\- 
Flus minimizes the need lor unlearning'. Tnlikc the hiilial Teaching 
Alphabet (i.l.a), it draws attention lo these regularities. 'Fhere is thus a 
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real prospect thai any earl>' advantages it might have would be retained 
on transition to conventional orthography. Those of i.t.a tended to be 
lost (Warburton and Southgate, 1969). 

Adults literate in English can read Equal-Plus (EP) at sight without 
any prior instaiction. Here for instance is a specimen text taken from 
a well-known children's book written at the turn of the centur\': 

Edith Nesbit (1906) We Stofj of the Amu=let Lund+n. T Fish+r 
Unwin. 



And now from the windo= ov a for we=ld cab the Kwe=n ov 
Babil+n beheld the vvund+rs ov Lund+n. Bucking+m Palacvshe= 
thaut unintresting; Westminst+r Aby and the Houzes ov 
Parl+m+nt lit+1 bet+r. But she= li=kt the Tow+r, and the Riv+r. 
and the ships fild her with wund+r and deli=t. 

But how badl>' u= ke=p yor sla=vs. How rechid and por 
and neglected tha= se=m\ she= sed. az the cab rat+ld along the 
Mi=l End Ro=d. 

Tha= arn t sla=vs; tha re werking pe=p+r. sed Ja=n. 

'Ov covse tha're werking pe=p+l. That's wot sla=vs ar, Du: 
u= supo=z I do=n*t no= a sla=v\s fa=ce wen 1= se= it? Wi= 
do=n t thair mast+rs se= tha're bet+r fed and bet+r ck)=thd? Tel 
me= in thre= werds,' 

Xo= wun ans+rd. The wa=j sist+ni o\' iiiod+rn Ingl+nd iz 
a lit+1 dific+ll to expla=ri in thre= werds e=v+n if u= und+rstand 
it — wich the childr+n did+n't. 

'r=*l hav a revo=lt of yor sla=vs if yor not carefu_r. sed the 
Kwe=n. 

'0= n()=\ sed Sirr+1: u= se= tha= hav V()=ts — that ma=ks 
al the difr+nce. Fath+r t()=ld me= s()=/ 

AVot iz this vo^t?' askt the Kwe=n, Iz it a charm? Wot du: 
lha= du: with it?' 

*I= d()=n't no='. sed the harasst Sirr+i: it's just a v()=t. that's 
aL! Tha= d()=n't du: enything p+rticu=l+r with it/ 

1= se='. sed the Kwe=n: *a sort ov pia= thing. \Xel, I wish 
that al the=z sla=vs ma= hav in thair hands this m()=in+nt thair 
fil ov thair fa=vrit me=t and drink/ 

Inst+ntly ai the pe=p+l in the Mi=l End R()=d. and in ai the 
uth+r stre=ts wair the por pe=p+l liv. found thair liands ful of 
things to e=t and drink. From tlie cat") windo=s cud be= se=n 
pers+ns carrying evry ki=ncl ov food, and bot+ls and cans a/ 
wel. R()=si me=t. fowls, reel lobst+rs. gra=t yelo=y crabs. fri=d 
fish, boild pork. bc=t sta=k pudings. ba=kt uny+ns. mut+n pi=s: 
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mo=st ov the yung pe=p+l had oranjes and s\ve=ts and ca=k. It 
ma=d an enornius clianj in the look of the Mi=l End Ro=d — 
bri=t+nd it up so= to spe=k, and bri=t+nd up mor than u= can 
imajin the faces ov the pe=p+l. 

'Ma=ks a difr+nce duz+n't it?\ sed the Kwe=n. 

The Main Characteristics of Equal-Plus 

Equal-Plus is greatly indebted to Regukmzed English (Wijk, 1959, 1969) 
and Cut Spelling (Upward, 1991) but is \er\^ different from both. Its 
main characteristics will probably be fairiy clear from the extract. They 
can be set out ver\' simply by making a few comparisons with Tradi- 
tional Orthography (TO). We can concentrate first on stressed syllables. 

Stressed Syllahles 

I. Within the set of words containing 'short \*ower sounds it is easy 
enough to identify a phonographically 'regular subset. This can be 
arranged by onset, the initial consonant element if any. and rime, the 
sound which remains, in the form: 



it ... — ... — ... 
hil . . . bet ... — ... but 
— ... — ... cot . . . cut 

etc. 

or alternatively can be arranged: 

bat cat 

bag can 
ban cad 

cap etc. 

V.V consi.stentiy represents tiie.se five vowel sounds by tiie letters <a, e. 
i. o, u> and it simplifies consonant representation so that for the most 
part, there is a one-to-one correspondence between consonant pho- 
nemes and consonant letters. For instance in the \esbit text, TO <have, 
as, said, when, is. of. what, does, one> become FF <hav. a/, sed, i/, ow 
wot. du/, wun>. Other representations include <plad, bad; sez, lied. 



at 
bat 
cat 
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frend; giv, six*, bild, niith; woz, cof, yot; cum. sum, mf, blud, >\ing> for 
TO <plaid, bade: says, head, friend; gix e. sieve, build. mNih; was. cough, 
yacht; come, some, rough, blood, young> and EP <sent, scl, jem. brij. 
ren, not, Fil> for TO <cent, cell, gem, bridge, wren, knot, Phil>. 

The EP vowel spellings shown here are the ones mo.st commonly 
used in TO for their respective sounds. They are also the ones used 
most frequently by young children in both their correct and incorrect 
spellings (Treiman, 1993). For children entering the alphabetic stage of 
spelling and reading, the consonant representations have a similar 
naturalness (Frith, 1985V At this caicial transition, preliminary studies 
show that the words generated by the specific augmentation of TO 
regularity illustrated are far more likely to be read correctly than their 
TO equivalents and that the spellings chosen are similarly more likely 
to be elicited than their TO equivalents. The effect sizes are large and 
do not seem to be influenced either by school or by accent. 

In many accents, such as Received Pronunciation (RP) and General 
American, a distinction is made between the vowel sound in 'putt, but* 
and the vowel sound in put, push'. In EP. so as to reduce the possi- 
bility of confusion, instances of this second sound are given underlined 
letters: 

<put, pull, push, cud (could), wud (would), shud (should), pul 
(pull), ful (full)>. The <oo> spelling of this sound is one which 
children find easy to learn. This is retained for all its TO in- 
stances and is also underlined: 
<book, took, good. wood>. 

1. The spelling paradigm for stressed syllables containing short' 
vowels is already fairly regular in traditional orthography. All that is 
needed is a little tidying up to make it user-friendly. In TO. the next 
most frequent pattern for stressed syllables is for obvious reasons often 
displayed in contrast to the first: 

mat, mate Tim. time slop, slope cub. cube 
scrap, scrape strip, stripe cod, code cut, cute etc. 

(If we confine ourselves to single-syllable words, there are of course 
few pairs such a.^ 'pel. Pete.) 

Token counts derived by aggregating the fre(|uencies of running 
words (e.g.. I)ewe\ 19S(): Kucera and Francis. 19(v; Reid. 1989) con- 
sistently show that the most fre(|uent representation for what might he 
labelled 'letter-name long vowels* is vowel letter, consonant letter, e (or 
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in polysyllables some other vowel letter). Unfortunately, in addition to 
using the convention exemplified above, TO also deploys a great range 
of tiresome vowel digraphs. Furthemiore, letter-name long vowels' may 
be indicated with varying degrees of probability by particular conso- 
nant sequences (e.g.. Id: cold, told: st: most, post). 

Choice of representation in EP was guided by developmental stud- 
ies of children's spelling (Read, 1986; Treiman, 1993). Inicially, children 
tend to represent these vowel sounds by the letters which have names 
corresponding to them. They subsequently begin to add vowel letters 
which may or may not be correct as markers, so <cake> might be 
written first <cak> and then <caik> and <boai> first <bot> and then 
<bote>. Equal-plus uses the letters which would come naturally to young 
children and then adds an equal sign as a marker. 

<may, made/maid, mate, way. wait/weight, wave, waved, pay, 

paid, pain/pane, great/gratc> are thus written: 

<ma=. ma=d, ma=t, wa=, wa=t, wa=v, wa=vd, pa=, pa=d, pa=n, 

gra=t>. 

<see/sea, seen/scene, seat, me, meet/meat, these> are written: 
<se=, se=n, .se=t, me=, me=t, the=7.> 

<ciy, cried, m\\ mine, might, pie, pine, pint> are written: 
<cri=, cri=d, mi=, mi=n, mi=t, pi=, pi=n, pi=nt> 

<no/kn(>w, known, go, goat, ghost, grow, grown/ groan, bone, 
boast, most, though> are written: 

<n()=, n()=n, g()=, go=t, go=st, gro=, gro=n, bo=n. bo=st, mo=st, 
tho=> 

<cute, cube> are written: 
<cu=t, cu=b> 

Pilot studies indicate that this convention is easy to learn. In only a few 
minutes, 7-year-old children with learning difficulties who had just 
learned to read and write phonologically regular consonanl-\'owel- 
coasonant (CVC) words (cat, dog, tap, tip, top etc.) achieved a high de- 
gree of success reading and spelling words like the ones illustrated. It 
is anticipated that use of the \*(iuar sign will facilitate perception of 
differences and similarities in the sound structure of words. Shifting the 
final consonant letter to the end of the word should increase use of 
information about its sound. Regularizing onset and rime units can be 
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Table 6.1: Stressed vowel sounds in Traditional Orthography 



Stressed Vowel Sounds 



ar: 


car, dark, charm, sharp, smart 


or: 


or, corn, pork, torch, short 


au: 


haunt, fraud, Paui, jaunt, taut 


aw: 


saw. raw. Jaws, drawn, craw! 


er 


her. term, jerk, perch, stern 


ur: 


fur. burn. hurt. surf, burst 


ir: 


girl, first, third, skirt, birth 


ou. 


Out. cloud, found, count, couch 


ow: 


now, cow, brown, owl, crowd 


oi: 


oil, boil, join, joint, moist 


ov 


boy, toy, |oy. Roy, Troy 


air: 


air, fair, chair, hair, stairs 


are 


bare, care, dare, share, spare 


ear: 


ear, hear, dear, near, fear 


eer 


deer, beer, jeer, peer, steer 



expected to make these more salient and to fac ilitate use of the kind of 
analogy strategies discussed hy Gosw ami and Biyant ( 1990). Decoding 
strategies in reading and encoding strategies in spelling should be much 
easier to establish. 

There is a minor problem with the representation <u=>. For a 
number of words such as tube", there are differences between British 
and American pronunciations: Americans pronounce the vowel sound 

ii: while the British pronunciation tends to be - ju: . By adopting <u:> 
for all spellings of , u: other than those represented in TO by the letters 
<oo>. significant discrepancies are a\'(^ided: the two signs <u=> and 
<u:> look ver\' similar and the use for a few do/en words of one rather 
than the other should not create problems. 

<oo> for . u: is a veiy simple spelling. It is learnt at such an early 
stage that it would be counter-productive not to use it in FF for words 
which in TO have this spelling for the sound e.g. <food, boot: groove. 
FF groov>. For other words such as <blue/blew. through threw, grew. 
June. Judy. rude. true, truth, truly. tomb>. the representation <u:> ac- 
cords better with the predominant TO representations and is probably 
preferable: <blu:. thru:, gru:. Ju:n. Ju:dy ru;d. tru;. tru:tti. tru:ly iu:m>. 
<y> is used for the sound at the end of <Ju:dy. tru:ly>. In most accents, 
this sound ckxsely approximates that of letter name long e'. 

TraditKMial orthography has a number of other major representa- 
tions for stressed vowel sounds. See Table ().l. The TO representations 

in Table (rl are retained and supplemented in 1*.F by: 
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a: grass, fast, bath, can't, dance 

Two further units are used: 

war : war, warn, ward, warm, swanii 
al: al, bal, tai, wai, siiiai 

The spellings <ss> in <grass> and <ce> in <dance> will be noted. 
The possibility of consistently representing the sound /s/ by <s> and 
the sound /z/ h\ <7> was explored: but the arguments for retaining <s> 
for regular noun and verb inflections irrespective of Whether they are 
voiced <sits, mns; cats, dogs> seemed overwhelming. This orthographic 
convention is learned ver)- early (Atkinson, 1985). To violate it would 
cause great confusion to children who fail to voice certain letters owing 
to immaturity and would lead to imnecessarj* puzzlement in particular 
language communities. Most London children for example pronoimce 
<paths> as /parfs/ so spelling the final sound with the letter <z> would ' 
simply impose an additional burden. Similarly, it seemed sensible to 
make a clear distinction in the orthography between singular words 
and plural words. In addition to <s>, three other TO spellings of /s/ 
have therefore been retained: <ss>, <ce> and <se>. However, the last 
two are underlined to indicate that they represent single sounds. Hence 
children are discouraged from pronouncing <horNC'> 'whores' or horsey', 
from imagining that <pence> means writing instruments <pens> and 
from thinking that <the prince> denotes more than one exotically titled 
personage (prin + s). 

If \\ ords has e \*owel sounds which correspond to the ones above 
but has'e different TO spellings, these spellings are changed to accord 
with the predominant patterns so as to facilitate the learning of the 
major phonographic representations e g: 

<m()re store, door, floor, court, four, oar, li()ard> nrc written: 
<mor, stor, dor, flor, coH, for, or, bord> 

<caught, taught, ougiit, bought, talk, walk> are written: 
<caut, taut, aut, Ixiut, tauk, wauk> 

<W()rm, word, work, eartii, earn, learn> are written: 
<werni, werd, werk, erth, ein, lern> 

<bear, w here, there, their, mayor> are written: 
<l'>air, wair, tliair, mair> 
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Table 6.2: Representation of syllabic consonants m Nesbit text 


Words in TO 


Words in EP 


Words in TO 


Words in EP 




<+n> 






London 


Lund+n 


Buckingham 


Bucking+m 


Babylon 


Babil+n 


system 


sist+m 


England 


Ingl + nd 


Parliament 


Pari +m +nt 


even 


e=v+n 






children 


childr+n 






didn't 


did+n't 






difference 


difr+nce 






brightened 


bri=t+nd 






moment 


mo=m+nt 




• 


doesn't 


duz+nt 






onion 








mutton 


mut+n 






Words in TO 


Words in EP 


Words in TO 


Words in EP 








<+r> 


little 


ht+l 


better 


bet+r 


rattled 


rat + id 


river 


riv+r 


people 


pe=p+l 


Tower 


Tow+r 


difficult 


dific+lt 


wonder 


wund+r 


bottle 


bot+I 


master 


mast+r 


Cyri! 


Sirr+I 


lobster 


lobst+r 






particular 


p+rticu=l+r 



Some minor distinctions in pronunciation which might bt^ made by the 
speakers of particular accents ha\*e been ignored. The aim is to link 
elements within written words to elements within spoken words in a 
manner which is generally useful to all speakers of English who are 
learning to read and write the language. There is no attempt to provide 
a phonetic transcription of any particular accent. 

I 'nstrcsscd Syllahlcs 

Two main simplifications are made to the representation of unstressed 
syllables. 

1. When we say the word button', there is no vowel sound be- 
tween the last two consonants. The peak of the final syllable is located 
in the fitial consonatit which would phonetically be written inl (Kreidner. 
\^)H^)). Syllabic n. m. I and r are consistently represented in VV <+n. +m. 
+1. +r>. Examples from from ihe Nesbit text are given in Table (i.2. 
^'oung children typically omit vowel letters when they wiite words 
containing syllabic consonants. Spellings encountered recently included: 
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Table 6.3: Syllabic consonant representation as a source of uncertainty 



WnrHc in TO 

VVUlUo 111 * \J 


\A/nfrl«; in FP 

V VUI Uo 111 L r 


Words in TO 


Words in EP 










heaven 


hev+n 






Devon 


Dev+n 


phantonn 


fant+m 


seven 


sev+n 


bantann 


bant+m 


Dillon 


Dil+n 






villai n 


vil+n 






vixen 








N ixon 


Nix+n 














lt;o^d 1 


Ipcc x n 






dozen 


duz+ n 






^UUo II 1 


cuz+n 






Wnrrl«; in TO 

V VUI Uo II I » \J 


Worrl<; in EP 

V V t^l uo III 1-1 


Words in TO 


Words in EP 




•fl 






kettle 


ket+l 


teacher 


te=ch+r 


nnetal 


nnet+l 


vicar 


vic+r 


model 


nnod+l 


doctor 


doct+r 


waddle 


wod+l 


martyr 


mart+r 


synnbol 


sinnb+l 


chauffeur 


sho=f+r 


nninnble 


nninnb+l 


mirror 


mirr+r 


chisel 


chiz+l 


Cheshire 


Chesh+r 


sizzle 


siz+l 


injure 


inj+r 


sandal 


sand+i 


Gloucester 


G!ost+r 


handle 


hand+l 






naval 


na=v+l 


metre* 


me=t+r 


navel 


na=v+i 


labour* 


la=b+r 


principle 


prinsip+l 


neighbour* 


na=b+r 


principal 


prinsip+l 







Note: * British spellings 



DOKTR <d()Ct()i\ FF d()ct+r>. DIXR <dinner, EP din+r>, 
JASN<Ias()n, 1'":F Ja=s+n>, PARSL <parcel. EF i:)ars+l>. BOTL <tx)t- 
lle. EF h()t+l> and HATNKhottoin, EF lx)t+ni>. 

At a later .stage there is uncertainly about which N'owel letter should he 
written and soiiietinies also uncertainty as to where the \'owel letter 
should be placed in relation to the consonant letter. Words illustrative 
of the problems confronting learners of TO are shown in lahle 0.3 
together with their EP ecjuivalents. 

Iu|iuil-Flus attempts to postpone such uncertainties until the basic 
orthographic structures have been accjuired. Transformation into EF 
seems to make words containing syllabic consonants considerably easier 
to read and write. In an informal preliminary' investigation, it was found 
that children of 6 and with learning difficulties who had just learned 
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lo read and write regular CVC words were able to read and write many 
of the EP words with less than five minutes practice using the <+n, +ni, 
+1, +r> units. With a corresponding random selection of the TO words, 
the only success achieved by any child in a matched group was in 
reading <teacher> and <doctor>. There was a marked tendency for 
children in the TO group to place the stress of some of the words on 
the wrong syllable. 

2. A number of soimds, of which /f/ as in <ship, brush, station, 
pension> is the most obvious example, tend to have different spellings 
in stressed and unstressed syllables. It is important that some of the 
main spelling patterns of longer words receive early aUention, for if 
children imagine that TO is a monosystemic representation of the sounds 
of the language, the\' are likely to experience considerable delay in 
learning to read and write w(uds which are polysyllal)ic. The units <ti, 
si. ssi, ci.> are therefore used in FP for the /s/ sound in words such as: 

menii+n, pensi+n, missi+n. majici+n. 

The consonant sound /3/ heard at the beginning of <Zhivago>, at 
the end of <rouge>, in the middle of <treasure> and at the start of the 
third syllable of <occasionally> also needs t(^ be represented from time 
to time. The choice of spelling is not easy because in TO no one 
pattern predominates. Perhaps it might be best to have a special unit 
sion for v/ords such as: 

vision, television. confu= sion . conclu: si()n . perswa= si()n . 
inva= sion 

could then be u.sed for words wliich already contain these letters 
such as: 

mesu+r 

and </h> could be used for what remains such as: 
ru:/h. 

One further representation is needed, a special unit for the ending 
<ture> which is fairK' fre(iuent in the texts young children read and 
write. This is written <lure>: 



pic ture , na=lure, cre= ture , fu=-lure, adv enture, tor ture , arkitecture 
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Tlie in\'enlor>* for unstressed syllables can now be completed. Schwa 
vowels, e.g., the .sounds repre.sented by the <a> in <attend, EP atend> 
and the <u> in <SLipport. EP SLiport>. are almost invariably represented 
by the .same letters which are used in their TO spellings unless they are 
covered by the <+n, +m, +1. +r> convention. 

Messages for Teachers 

Equal-Plus attempts to alleviate some of the burdens which an 
unreformed orthography places on those w ho are learning to read and 
write English. Its central features are a \'er\' simple method of represent- 
ing letter-name long vowels' and a ver\' simple method of representing 
.syllabic consonants. These provide a wide range of potential teaching 
.strategies. 

The.se features could be u.sed on their own. A .sy.stem which radi- 
cally simplifies English orthography would .still remain. Scarcely any 
unlearning would be recjuired: scarcely a single letter in a single word 
would need to be changed on transfer to traditional orthography. 
By extending cursive ligatures, to repre.sent the ecjual sign, by drawing 
a line acro.ss ligatures to repre.sent the plus sign and by putting a dot 
over ligatures for <u:> it would be easy to write all words in joined 
writing: 



ta=k. ta=king, ta=k+n. Pad+m+nt, cu=t. su:t, di=lu:l 

I-P could be u.sed in many other ways. C^hildren could u.se it in 
their writing for words they found difficult to .spell. Teachers could u.se 
for it for showing how the .same .sounds arc repre.sented differently in 
conlra.siing .sets of words or for showing how T(^ .sometimes represents 
different .sound.s, e.g., tho.sc corresponding to the tcn.se marker <ed>, 
with the .same letters .so as to give easily identifiable units of meaning. 
The plus .sy.stem could be extended to all \schwa* or 'neutral vowels'. 
It could be restricted to word.s children found difficult. But behind EP 
and all its potential variants is a .single idea: the notion that the main 
features of I-nglish orthography should ix* taught first and minor details 
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taught later. Take a particular word by way oF example. <Me=i> is suffi- 
cient to identify a word in the child's spoken lexicon. Ha\ ing already 
learnt many of the basic structures of English orthography, children can 
learn through subsequent word study the particular letter which is used 
as a marker. 

They might for instance be told a stor^' about the netball or football 
tcY/m of Ecfsl Acton School being taken to the smside as a special trcv/t 
by their tc^^/cher. The \ccim could ha\'e a mecA on the bcY/ch. They might 
ecn meca and bcY/ns and pcY/s and then fcY/st on pcY/ches and crcY/m. 
i:V/ch child might then be gi\'en either lea or ice crcYnii. Further incidents 
could easily be contrived. Children who were usually neai and tidy 
might get their clcYm \ecins dirty. Someone else might make a seal out 
of discarded clothes and be bitten l'>y flmv. In this way, a number of 
words with a particular sound could l')e memorably associated with a 
particular letter secjuence. One spelling would tend to reinforce another. 

Other sounci-symb(M correspondences might be taught ver>* differ- 
ently. For instance, older children might be given an historical explana- 
tion of the <o> spelling in <money. month, son> (Scragg, 19-4). The 
different TO realizations of <+n>. <darken, action, magician>, might be 
approached through imits of meaning. 

What is important is that all children accjuire at an early stage a 
basic soimd-symbol structure on which they can build. A disturbing 
outcome of the research into i.t.a. was the strong aptitude x treatment 
interactions. Downing (19()'') summarized \*ery fairly the results of his 
own early lesearches: 

The improvements in t.o. reading are most noticeal')le generally 
among the high achie\ ers — those who 'will learn anyway and. 
therefore, do not really need educational inncnations* accord- 
ing to .some British educators. The .slower learning children do 
tK'gin to .show .some tx^nefit from i.t.a at the end of the third 
year, l'>ut the poorest 10 per cent show negligible inipro\ ement 
in test results. (Downing. 1967, p. 29.^) 

Ixjual-Plus is much .simpler than i.t.a and. unlike i.t.a. it deliber- 
ately foregrounds TO regularities. It is anticipated that its effects on 
•slow learners' or those with .specific learning difficulties could usually 
be shown in a few weeks and its effect on high achie\ers in a few 
hours. The respon.se of young children with learning difficulties has 
been highly encouraging h\\[ sul')stantiai da.ssroom-ba.sed research, for- 
mal and informal, is now needed with a wider population. For this, 
there are two main recjuirements. 
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First, it is necessar>' to specify particular orthographic domains so 
that the effects of specific transformations on the reading and spelling 
of words having paiticular characteristics can be determined. For teach- 
ers who are used to recording the content, method and outcomes of 
their teaching, this strategy will appear fairly natural. It is also highly 
important. Fqual-plus is not a cut and dried system. Many details will 
no doubt be improved by modification. One needs to know which 
orthographic transformations are beneficial and to seek open-mindedly 
viable alternatives for those which are not. 

Second, careful attention needs to paid to the possible effects of 
teaching method. In the research reported by Downing and Jones (1%6) 
— commended by Warburton (Warburton and vSouthgate, 1969, p, 207) 
as 'the most efficiently designed research that has been carried out [into 
the effects of i.t.a,l and hence the most convincing' — five out of the 
twelve teachers said that they used phonic methods earlier with the i,t.a 
class than with the TO class. There were no differences in the opposite 
direction. vStudies consistently show that code-emphasizing approaches 
produce reading attainments which are superior to those associated 
with other approaches (Chall, 1979. 1983: Perfetti, 1991), As Warburton 
comments. There is no doubt that the t.o, children grew in inferior 
phonic soil' (Warburton and Southgate, 19(i9, p, 260). It is important 
that the effects of medium and method are not confounded. For those 
conducting large-scale research, calculating through analysis of \ari- 
ance separate effect sizes for each would be one possible strategy for 
avoiding this. 

An alternative — generally much more accessible to schools and 
preferable in many ways — is to ha\*e matched groups or subjects 
taught by a code-based approach in TO and FP, This possibility is 
feasible even within a single classroom, Howe\ er if the children are 
sometimes taught in withdrawal groups, control of factors influencing 
outcomes will probably be easier, (Massroom materials can then remain 
entirely in TO, The FP withdrawal groups can make extensive use of 
TO texts with interlinear FP transcriptions so that the potentialities of 
F.P as a phonographic code are fully exploited. They can be encour- 
aged to explore ways in which their speech maps onto TO, Thus trans- 
fer to traditional orthography, instead of being left to the end of the 
intervention, can be made an integral part of most of the reading and 
writing lessons. 

Whether such experiments are formal or informal, their results 
should lead to v aluable reflection on both medium and method. To 
achieve substantial improvements in standards of literacy, we may need 
to change both, 
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Conclusion 

Equal-Plus offers new ideas for teaching children the main structures of 
the English \\ riting system. It accords well with what is known about 
the successive competences in\'olved in learning to read and write. 
There seems a real prospect that its use will improve children's ortho- 
graphic understanding and that the development of both word recog- 
nition and spelling will be facilitated. This possibility, which would 
enable teachers to give greater emphasis to higher-order reading com- 
prehension skills and to the discourse-level skills needed for different 
kinds of writing, is susceptible to immediate investigation within the 
c lassroom. 
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Summary 

The nature of llie reading domain is discussed from the assess- 
ment point of \*ie\\' and a model is proposed w hich includes a 
range of contexts of performance, each pitched at a defined 
level of maturity. This is the differentiation by task approach to 
the assessment of reading in the National Curriculum. Empirical 
data from tw enty-five teachers in Northern Ireland are repc^ed 
to show that teachers w ho produce high-quality assessment w ith 
the tasks tailored to the individual child, are also teachers w hose 
class average-reading scores tend to he high. The implications 
for teaching and testing are discussed. 

Introduction 

The assessment of reading is an almost universal practice in schools 
(Gipps and Goldstein. 1983) and yet we have learned ver\* little from 
all this activity. Large cjue.stions remain unan.swered at both national 
and classroom le\els. Ila\e reading standards deteriorated (Turner, IWO)? 
What is the best teaching approach for this non-reader? When X'incent 
and de la Mare (198()) came to revise the individually administered 
Neale test. the\' could not find any systematic account c^f the diagnostic 
e.\pei1ise which they beliewd that teachers must ha\'e developed in 
using the test. There is no point in a.ssessing reading unless the test 
generates usable, and useful, results. C'leady the issue of utility, the 
guiding principle of test construction, is a real one in relation to the 
assessment of reading. 
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The issue hiis been sharpened h>' the iniroduelion of National 
Curriculum Assessment, itself an uneasy compromise between compet- 
ing approaches to the improvement of teaching and learning. The Task 
Group on Assessment and Testing adx'ocated formative assessment as 
the major engine of curriculum change (DFS, 1988), It took the view 
that direct assessment of pupils by their teacher would sensitize the 
teacher to what the pupils had made of the teacher's intentions and 
create a feedback loop for the improx ement of performance of both 
parties. If national-assessment instruments were developed as tools for 
teachers to use in obtaining this feedback, every classroom teacher 
would become a potential action researcher and the ([uality of educa- 
tion would be raised. But there is a competing model for educational 
improvement. The publication of league tables of school performance 
combined with greater freedom of parental choice of schools puts the 
parent, rather than the teacher, in the position of action researcher and 
l\rings the evaluative potential of National Curriculum Assessment to 
the fore. The improvement of teaching and learning by closing down 
uiiderachie\'ing schools Lmd weeding out ineffective teachers calls for 
a very different kind of assessment tool. If schools are to be ex'aluated 
by the performance of their pupils, it does not make sense to involve 
teachers too deeply in the determination of lewis of attainment. 

The tension between formati\'e and evaluative functions of assess- 
ment is not confined to Britain. In the Tnited States it is described as 
the tension between instructional assessment (Wiggins, 1989) and ac- 
countability assessment (Mehrens. 1992). Nor is it simply a debate about 
purposes. If fitness for purpose is to inform the design of assessment, 
tliese competing approaches lead to entirely different specifications for 
effective reading tests. 

Defining the Reading Domain 

The problem of assessing reading is to determine along whicli dimen- 
sion assessment should take place. Some meims must be found ot 
honouring the multiplicity of contexts in which reading is demanded, 
the range of strategies and competences that the reader may bring to 
bear and the efforts of teachers to extend that rimge to ever more 
reading contexts and ever more children while keeping this complex 
interplay within bounds. iMgure 7.1 schematizes these three factors. It 
only one is chosen any variation in the other two is essentially ignored. 
It certainly goes unrecorded. All observations which vary along otiier 
than the chosen dimension are treated for measurement purposes as 
repetitions of the main dimension (Cattell, |9()()). 
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Figure 7. ): Potential dimensions in the measurement of reading 



Maturity 




Skills and Contexts of 

competences perfornnance 

Each of the main dimensions in Figure 7.1 is itself an amalgam of 
potentially independent dimensions. The reading context could readily 
be subdivided into the linguistic difficulty of the text, the genre to 
which it belongs, the purpose for which it is being read, the format in 
which the reader's response is to be recorded and the rater's stance, 
whether participant or remote, but all of these variables can be sub- 
sumed as elements of the context of performance. Contexts of perform- 
ance var\' along a dimension of task authenticity. 

Table 7 1: From state intervention to the revised code, 1862, page 259 

Reading Standard I Narrative monosyllables 

Standard li One of the narratives next in order after monosyllables in 

an elementary reading book used in the school 
Standard III A short paragraph from an elementary reading book used 

in the school 

Standard IV A short paragraph from a more advanced readmg book 

used in the school 
Standard V A few lines of poetry from a reading book used in the 

first class of the school 
Standard VI A short ordinary paragraph in a newspaper, or other 

modern narrative 



The earliest attempts at large-scale reading as.sessme!it were based 
on authentic classroom tasks. These were under teacher control even 
though this was high stakes* testing — salaries depended upon the 
outcome. As the summaiy statements of the State Interwntioii to the 
Revised Code of 1H62 indicate (see Table 7.1). reading was to be as- 
sessed by reference to a range of even' day reading tasks di,stinguished 
largely by the nature of the texts involved. Tiie 'standards' can be 
ceptualized as a set of cells moving diagonally upwards across the 
context-maturity facet of Figure 7.1. 

I'nfonunately, while this early attempt to define the reading do- 
main paid due attention to a diversity of texts, the skills and competences 
to be assessed were largely confined to recitation. The specification of 
performance was not informed by an analysis ol tlie reading construct. 
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The first National Cunriculuin in Fngland failed to specify comprehension 
as a component of reading skill. Generations of barking at print' ensued. 

The dimension of reading competences in Figure "".1. became the 
focus of early psychometric tests of reading (Bun. 192 h Vernon, 
Schonell and Schonell. 1942). These made little effort to describe the 
complexitv of contexts in which reading take> place, hi accordance 
with the abilities model, an opposite reductionist tendency held sway. 
The attempt was to identify the kernel, the single, pure representative 
.sample of the activity, which would predict its various manifestations. 
Operalional definitions of reading skills and competences became en- 
tirely circular, being provided only by the tests themselves. There was 
no discourse model against which to test construct \ alidity (Spolsky. 
1990). Te.sts which fractionate reading into skills create a considerable 
di.stLmce between the real performance and the various subtests of the 
instrument. Only the expert user is likely la appreciate the ettect on 
construct validilN' of selecting one .skill from the whole, .^s a result, 
although these early psychometric tests ha\e (out or five subtests a 
potential profile of competences w as frequently sacrificed in practice to 
a single surrog^ite measure of the maturity diniensic^n. the graded word- 
reading li.st. 

l^ie measurement model of the time, classical true sccmv theory* 
(see Cronbach cl cii. U)"'2. Chapter 2) was but pcK^rly responsive to 
mullifaceted accounts of learning and to the fact that indi\ idual chil- 
dren mav follow different routes to comparable le\'els oi attainment. 
The use of classical true score theor\' to evaluate te.sts of educational 
achiewment was alnu^si a categoiy errc^*. It treated achievement as a 
linear function of owe or nu^re abilities. The.se claims for the predictive 
\'alidil\' of .surrogate mea.sures robbed the maturity dimension ot its 
central pedLig(\i;ical impedance. The key tjuestions. what develops, in 
wliLil order if any. and why. were not addre.s.sed. 

I-igure "'.I sets out the three dimensions of the reading-a.s.sessmenl 
domain as essentially (MthogcMial to each cnher. While .scMUe contexts of 
performance will (vcur w ith greater frecjuency than others as the child 
de\elops. it is e.ssentiLilly the ca.se that any ccMitext of performance 
could occur at any le\el c^f maturity. Similad\'. altlunigh any cc^ntexi 
may make .salient demands on (Mie ox ancnher ccMupctcnce — the idea 
of primarx traits* which underlies the writing dcMuain in the .\merican 
Natic^nal Assessment (^f F.ducational Prc\i;ress — all ccMupetences ha\'e 
pcnential in all contexts. Young children are ju.si as much engaged with 
the ccMuplexities of the reading process as their colder ccninterparts 
(Clhittenden. 19S3). Finally, there are those who would claim that there 
are no linear hierarchies in children's devclopr 'Cnt of understanding as 
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ihey mature (Noss et aL, 1989). The context-maturity facet of Figure 7,1 
respects that position as a starting point in defining the reading domain. 
Models sucii as Figure 7. 1 ha\'e to be e\'aluated both in terms of 
how closely they represent reality and also in terms of the extent to 
which they meet the utilitarian purposes for which they are designed. 
The Torch tests (Mossenson et ai, 1987) carr>' all tlu hallmarks of 
Figure 7.1 as a representation of reality, Ele\'cn distinct types of skill 
(the competence dimension) are identified, ranging from copying com- 
plete sentences \'erbatim from the text to inferring emotion from a few 
scattered clues and the writer's lone. Furthermore there is a range of 
texts (the context dimension) for different degrees of maturity in reading. 
However, the performance context is much diminished, just as in 1862, 
by the use of only one mode of response, a variant of the clo/.e test. 
Heap s (1980) analysis of extraneous test-related rather than reading- 
related problems asscKiated with as,sessment concludes that only diver- 
sity in format can prevent cumulative and invalid deficit for some 
children. The instrument effect*, which reliance on a single mode of 
response inxites. creates an artificial concordance between all skills and 
all contexts. 

Instrument effect in the Torch scale is open to' funher reinforce- 
ment through the use of the Rasch ( 1960) model to select items for the 
test. The Rasch model assumes a single dimension linderlying the test 
(maturity in Figure "'.!) and further requires that all test items relate to 
the dimension in preciseK the same wav. Thus the inclusion of an item, 
and hence the operational characteristics of the dimension, are crucially 
dependent upon the characteristics of all other items in the test. Should 
an item not behave in the same way as the generality of other items, 
say it has a markedly higher correlation with the total score, then that 
item will be rejected as part of the scale. A Rasch scale therefore auto- 
matically excludes any task which is likely to create differentiation n ithiJi 
the reading performance of the individual child. It is precisely this 
within chiid differentiation which leads to different readers accessing 
text in different ways, the essence of Clay s ( 1988) 'Reading Reco\ er\" 
Program*. An essential diagnostic element is taken out of the test at the 
point of reading scale construction. Within child differentiation is as 
potent a threat t(^ the integrity of a Rasch scale as it is to internal 
consistency as a measure of reliability (Stanley, 1971). 

Item iiomc^geneity is the main Rasch criterion. It is a method of 
reading-test construdion designed to differentiate readers, not reading. 
In the case ot the Toix h scales, which are \'er\* long, ccn ering seven 
years of educational progress, an admirable heterogeneity of text and 
of skill demand is elided ihiougli a single item type towards a single 
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global score of reading ability. In the interests of manageability — the 
iililiiarian purpose — the multifaceted model of Figure 7,1 is collapsed 
into a single maturity dimension. 



Choosing a Measurement Model 

A single dimension, the testing objective of the 1920s, is disastrous from 
the pedagogical point of view. How can teachers respond to such a 
score? It is a single number and all the teacher can endeavour to do is 
to make it larger. In these circumstances teaching to the test is the only 
sensible strategy as Goodhait s Law demonstrates in economics, 
attempts to manipulate a predictive variable, rather than to manipulate 
the performance that the variable predicts, immediately destroy the ,sel 
of relationships which lent the variable predictixe validity in the first 
place. The mini-economy of many classrooms echoes our national 
economic plight. Ritualistic beluu iour is rife. 

If a one-dimensional model is ineffectual, a three-dimensional model 
would be unmanageable. It would be far from clear how to interpret 
scores in terms of a way forward for the child or class. We are left then 
with a choice between competing two-dimensional models as guides to 
test construction. Three different bases of argument all point to the 
context-maturit>' facet as that guide, Alexander ct al. ( 1991 ) systematize 
ox er 300 articles on the usage of knowledge terminology in studies of 
learning, FoUowing Flavell (19H7). they analyse metacognitive know- 
ledge into three major elements. 'Self-knowledge', how individuals 
percei\'e themsehes as thinkers and learners, grows from knowledge 
of one's own and others* ta.sk performance. Task-knowledge', how in- 
dividuals categorize the types of task that they encounter, grows from 
experience of tackling them. 'Strategic knowledge', the awareness that 
different procedures or processes may have to be initiated to permit 
more effccliw learning or thinking, grows from the other two types of 
knowledge (pp. 328-9 passim). The promotion of , such metacognition 
of the printed text and its pleasures is eveiy teacher s goal. It is pro- 
moled through encounters w ith authentic ta,sks. 

iienning and Ca.-scaller (1992) provide pragmatic grounds for se- 
lectini; the context-maturity facet. Their empirical study of the Ameri- 
can 'Educational Testing Service' (ETS) Test of English as a Foreign 
Language found such strong interactions between tested communica- 
tive competences and the social register and pragmatic futu'tioirs of the 
contexts of communication in which they were tested that the researchers 
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conclude that any test of conimunicative competence would he better 
to specify contexts of performance to ensure validity of assessment 
(j). 21). More expert readers (tiiese were potential university students) 
use task analysis to make strategic decisions aboiu how to approach a 
communicative task. As a result authentic tasks cannot be irsed to pro- 
vide direct assessment of separate and distinct skills. The phenomenon 
is now well attested in language testing even at tfie level of the indi- 
vidual item (Alderson. 1989). The pragmatic issue is whether, in the 
face of a profusion of competing and contradictor^' language models, 
each suggesting a different partitioning of the competence dimension. 
lk:jre is point in teasing out how any indiv idual has responded to any 
item which is not itself an authentic task. 

The third consideration is both ethical and educational. Messick 
( 1989) has argued for the past decade that the evaluation of both the 
intended and the unintended consecjuences of any testing is integral to 
the validation of test interpretation and use. Task-dri\ en performance 
assessment promises authentic and direct appraisals of educational 
achievement leading to positive consecjuences for teaching and learn- 
ing. Conslruct-driven assessment on the other hand must be held re- 
sp()nsil'>le in part for Cato ct al.'s (1992) finding that teachers lacked 
awareness of the different language demands of areas of the ci»rriculum 
and had difficulty in articulating both their methods and the view of 
language which underlay their choice of method (p, 10). in fact their 
classroom practice itself ofien reflected a greater variety of technicjues 
than they had specified in either questionnaire or interview (p. 25). it 
is an unintended consecjuence of our current approaches to the testing 
of reading that it deskills teachers in thinking about the teaching of 
reading. That is the real test bias (l-rederiksen, 198 0. 

National CAirriculum assessment offers the opportunity to redress 
the iralaiKV. The definition of reading is atheoretical. eclectic rather 
than focused, and describes a set of weak progressions — some, more, 
a lot inore — without analysing how they can Ix^ brought about. The 
utility of i'igure "^,1 is its implicit guidance that we lest that curriculum, 
not through a set of internally consistent competence constructs, but 
through authentic tasks, each free-standing and exemplifying a State- 
ment of Attainment, that is through context-dependent task perform- 
ance, W hether the Statements of Attainment are pitctied at the riglu 
lev el of specificity, or even ttie right level of attainment, whether thev 
are too many or too few. wtiether the aggregation rules make sense, is 
thereby rendered almost irrelevant in tlie short-term. "Hiese are all 
empirical cjuestions capable of empirical resolution against an educa- 
tional criterion; their efficac\' in guiding teaching endeavour. 
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Educational Effects of the Model in Action 

The Centre for Formative Assessment Studies (CPAS) in its pioneering 
work on the assessment of end of Key Stage 1 assessment in England 
and Wales (SEAC, 1990) developed the idea of a separate task to ex- 
emplify each Statement of Attainment. The consequent problem of 
manageability (there were 228 Statements of Attainment) was tackled 
by giving priority to informal teacher judgment. The teacher decided 
which tasks were well within the capacity of the child and only for- 
mally tested the area of proximal development (Moll, 1992): the teacher 
set only those tasks which were a potential challenge to the individual 
child. External statistical monitoring was used to validate the teacher's 
choice of task and the outcome of the task itself validated the teacher s 
judgment of the child's level. In essence the task was used to confirm 
the teacher's every^lay classroom obsen'ation and such confirmation 
was forthcoming in over 90 per cent of some 3,000 judgments made of 
reading. A modified form of the procedure in which the teacher's judg- 
ment of entr>' point had to be confirmed by all tasks at the appropriate 
level was adopted subsequently for use in all Key vStage 1 assessments 
of reading in England and Wales. 

CPAS was later approached b\' the Northern Ireland Schools Ex- 
amination and Assessment Council (NISEAC) to supply a reading test 
along these lines for its end of Key Stage 1 assessment which occurs at 
age H. The data which follow refer to the trial of that reading test. 

Method 

The test is in four major parts, each a set of reading tasks presented as 
a pupil reading book with table of contents and index as appropriate 
to the reading lewl. Lexeis 2 and 3 are in the same book with a defined 
break point between levels 2 and 3- Levels 1 and 4 are in separate 
books. In addition there is a separate reading-aloud passage which can 
be used to determine whether borderline children should start at level 
1 or go straight to the iex'el 2 and 3 book. There are thus five possible 
starting points which are under teacher control. The exit point is deter- 
mined by the child's performance but there is again an element of 
teacher decision as to whether the child should proceed. This interac- 
tive approach to the testing of reading was completely new to the 
twenty-five teachers from eleven primar>' schools who took part in the 
trial. They had a two-hour introductor>' session on how to use the test 
structure and on the scoring of outcomes. In the following eight 
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schooldays they tested between six and thirty-four children each, yield- 
ing a total pupil sample of 492 children for whom individual task out- 
comes are available. 

From these data two variables of immediate interest have been 
derived: the outcome level for each child and an index of assessment 
quality for each teacher. Outcome levels have been assigned using the 
n-1 aile, that is the child is awarded a level providing that attainment 
is demonstrated in all or all but one of the tasks at a level. There is a 
task for jach Statement of Attainment (SoA). However not all teachers 
completed all appropriate assessments of a child. Where no level out- 
come has been established the child has been assigned to the level 
immediately below the level assessed if there was attainment of one or 
more SoA at the level assessed and to two levels below the level as- 
sessed if there was no positive attainment at the level assessed. 

Assessment quality is on a three-point scale, with one penalty point 
subtracted from the three point maximum for each of the following 
circumstances. 

• -1 if no level is established for the child; 

• -1 if the child establishes a level but the teacher fails to con- 
firm the level although confirmation is possible, e,g., if a child 
completes all level 2 tasks successfully and is not then gi\en 
the opportunity to attempt level 3. If the child attained level 2 
with one missing attainment (the n-1 rule) that was taken as an 
indication that level 3 was not yet possible and the point was 
not deducted. If the child attained all level 4 SoA, the point was 
not deducted as there were no level S tasks available to use as 
confirmation. 

• -1 if the child was taken two levels beyond his or her achieved 
level, e.g., a child who attains level 1 should be tested at level 
2 but if level 2 is not attained should not be tested at level 3. 

Assessment cjuality has been scored for each child individually and then 
averaged over the pupils in a class to produce a teacher-assessment 
quality score. 

Results 

This was the first trial of the test, so the items were untried. To what 
extent then did they reflect the expected structure of the context- 
maturity facet? According to the National Curriculum there are a series 
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of contexts of performance for reading at each level but the scoring 
rules assume that there will he complete master)' of all the contexts of 
performance at a level before the next level of maturity is also attained. 
The alternative view would have very different levels of maturity in 
different contexts of performance. These are referred to as strands' in 
the National Curriculum documentation, 

A total of 380 children were tested at two or more levels of matu- 
rity. In only twenty cases did this untried test produce inconsistent 
results with a higher level attained while a lower level remained incom- 
plete. In ever>' case of inconsistency, level 3 with four tasks was achieved 
while level 2 with five contexts of performance remained incomplete. 
The index of level consistency is ,95, sufficiently high to allow for the 
exploration of assessment quality. The index of level consistency indi- 
cates the strength of the maturity axis of the context-maturity facet. 

The context of performance axis is much less well defined in the 
lest. There were forty-seven cases of positive attainment at two adjacent 
levels of the test without either level being achieved,, forty-five of those 
involving levels 2 and 3. As such cases arise from deficient assessment 
strategies on the part of the teachers concerned, their frequency is an 
underestimate of the potential for this occurrence and cannot in itself 
be taken as evidence of the strength or weakness of the contexts of 
performance axis. However examination of these forty-five cases pro- 
vided only weak evidence for two strands. In most cases the outcome 
appeared to arise from random errors on the children's part. 

Teachers adopted one or other of two .strategies in conducting the 
trial. Ten teachers used just one or two test levels and selected children 
to fit the test. From the point of view of assessment quality this was a 
poor strategy, yielding an accurate assessment of only 27 per cent of 
the ninety-seven children tested in this way. The alternative strategy 
involved complete classroom groups with the test form used tailored to 
the level of the individual cliild. This approach, adopted by fifteen 
teachers, yielded accurate assessments of 53 per cent of the 395 chil- 
dren involved. Within this group of teachers, however, assessment quality 
varied a great deal (mean = 2,29; s,d, = ,27) from 5(^ per cent totally 
accurate assessments to 77 per cent totally accurate assessments. When 
teacher-assessment quality was correlated with the mean reading level 
of the class (mean = 1XV\ s.d. = ,44) a significant positive relationship 
was found (r = ,48 with 13 d,f; p < ,0"^). This relationship is much 
stronger than the relatk)nship between reading level and the accuracy 
of assessment of the individual child (r = ,30; d,f, = 490; p < ,01) and 
cannot be attributed in its entirety to an artefact of the method of 
computing assessment cjuality. 
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Furthermore the class advantage in mean-reading level was not 
won by either homogenizing or differentiating the class group. The 
correlation between the within-class standard deviation (mean = 0,81; 
s.d, = 0,14) and assessment quality was es,sentially zero (r = ,08 with 13 
d,f,; n.s,). 



Messages for Teachers 

The implications for teachers are fourfold. Firstly, it should be recol- 
lected that this was the ver>' first trial of these tasks and yet they fell 
neatly into their assigned levels. The National Curriculum may look like 
an eclectic set of Statements of Attainment but these SoA do delineate 
stages in reading acquisition with some accuracy. Using the SoA as a 
guide, it is possible for teachers to devise appropriate reading tasks for 
each stage and to be clear about where to go from where each young 
reader is at. 

Secondly, being clear about what to do next dej^ends on an accu- 
rate picture of where the young reader has reached, Using a uniform 
test which measures only one or other of the two dimensions, maturity 
or context of performance, will not provide the information. Teachers 
who addressed only the contexts of performance at a single level of 
maturity learned only that the children were not where the teacher 
had thought they were. Years of use of tests which measure levels 
of maturity in a single context of performance have proved equally 
uninformative. 

Thirdly and most importantly, teachers who were sensitive to in- 
dividual pupil performance, challenging the child without setting tasks 
totally outwith the child s current capacity, taught classes with higher- 
average levels of reading performance. Correlation does not imply 
causation so it cannot be said that accurate assessment of current per- 
formance causes progress. Rather what is being argued here is that tests 
should be constructed in such a way that the a.ssessmenl skills required 
of teachers should be at least a subset of the professk)nal skills of good 
teachers of reading, 

r'inally. accurate assessment is not associated with any exaggerated 
differentiatk)n of the reading capacities of the children in a class nor 
with classes where all children arc kept at the same stage of reading 
acquisition. In Northern la'land at age 8 all of the first four levels of 
reading maturity are to be found in the average classroom. Some chil- 
dren just take a long time to get to level 2. 
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Conclusion 

The fundamental problem in reading-le.'^-l construction is that so little is 
known for certain about what is to be measured and that is, in part, 
because of the way that reading has been conventionalK' measured. 
From its earliest days, reading assessment has polarized the reading 
process. A series of unneeessar\' oppositions has been created between 
the reader and the text, ability and achievement, prediction and per- 
formance, a single capacit\' versus congeries of relatively independent 
competences or Armstong's (1990) specific objection to National Cur- 
riculum assessment as opposing the measurement of outcomes to the 
essential intersubjectivity of education. They manifest themselves in the 
distinction between construct-driveii and task-driven test construction 
but have a deeply pragmatic root. 

It is manifestly obvious that to measure reading reciuires a sample 
of reading performance. The choice is therefore between an explora- 
tion of the performance in context and the prediction of performance 
from surrogate activity. A full performance assessment is complex, time 
consuming and much influenced by the manner, mode and focus of 
exploration (Resnick and Resnick. 1991: Linn ci aL 1991), It is an 
interpretative act. It places the teacher at the heart of an interactive 
process, li.stening. probing and only incidentally counting as the child 
engages with the text. Performance prediction on the other hand can 
be quick (see Ballard's (1920) One Minute Reading Test), reasonably 
objective and readily standardized but unrepresentative of the com- 
plexity of the ix^ading process. Ironically tlie logistic advantages of the 
predictive approach have proved paramount in schools and school 
.systems. Hase of management of reading-te.st admini.stration has taken 
precedence over pedagogical utility in informing the management ot 
reading accjuisilion. 
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Summary 

This chapter introduces the broad principles and approaches to 
reading assessments in Australian settings. The history- of pro- 
fihng in Australia, the I'K and in the I'SA is outlined. A de- 
scription is gi\'en of the Victorian Reading Development Scales 
used to monitor the literacy progress of children o\'er years 1- 
10. Data from the trials on over S,()()0 pupils over four years are 
discussed and an outline is provided of how practising teachers 
use the profile framework to organize reading-assessment ac- 
lix ilies in the classroom. 



Introduction 

The phik)sophy at the heart of any assessment system should be the 
promotion of students' learning. This meims that as.sessment goes hand 
in hand with teaching and learning and should not be an afterthought. 
Ikx ause of such an integration, Lissessment should pro\ ide information 
on what may happen next in the learning and also information about 
current, and pre\'k)us. leiirning. With this philo.sophy in mind it is possible 
to develop central principles of as.ses.sment 

Principles of Assessment 

1. Assessments should describe the skills, attitudes and concepts that 
the student has dexeloped. These should be rehited to the curriculum 
and the instructional objectixes. That is, the assessment should be 
criterion-referenced. This leads to li description of wluit the student CLin 
tk) rather than a comparison with other students, both teacher and 
.student need li clear desc ription of the performances expected. 
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2. Assessments should enable the teacher, the parents and the 
student, where appropriate, to analyse the learning, or lack of it, taking 
place. This formative, rather than diagnostic, approach makes recom- 
mendations of further action possible. 

3. Assessments should be agreed upon by two or more teachers 
working together. This moderation is a process of bringing individual 
judgments in line with general standards. vSharing experiences and judg- 
ments of students' achievements with other teachers can help develop 
and maintain a common understanding of what could be judged as a 
standard or appropriate performance, 

4. Assessment should fit within an overall plan of the development 
and progress of the student. Thus each assessment task should reflect the 
expected progress and development of the student. Criterion-referenced 
assessment can be used to describe the development and progress of 
students in terms of tasks perfomied, skills developed, altitudes exhib- 
ited, aspirations displayed and knowledge accjuired. Various standards 
or criteria for each of these characteristics can be identified and organ- 
ized in order of increasing sophistication. When assessment tasks are 
set for students, their perfonnance can be compared with these standards. 

5. Assessments should allow for uncertainty in obsen-aiions. There 
is always room for some error in a test score and notions such as the 
reliability of a test have been defined in order to allow for uncertainties. 
Replacing tests with other forms of assessment does not necessarily 
increase the accuracy. Thus assessments need to be graded in judgment 
to allow for uncertainty, and assessors need to allow for error in their 
judgments. 

6. Assessments should de.scribe the students* progress in a fair, 
relevant and accurate manner and should have the same meaning for 
different teachers, parents and students. The assessment information 
should be both interpretable and communicatiw. 



Method 

The rest of this chapter describes the developing and refining of an 
assessment approach involving student profiles. The assessment infor- 
mation is interpreted in terms of the task, the group and the individual. 
The Victorian Literacy Profiles project, a practical example involving 
S,()(K) students in the Victorian education system, demonstrates how an 
education system or region can apply the ass:.*ssment principles out- 
lined at the beginning of this chapter. 
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Hoiv the Scales Were Developed 

When de\eloping the profiles-assessnienl system, teachers in work- 
shop sellings were asked to group areas of language development into 
the major skills of reading, writing, listening and speaking. Initially, 
they focused their attention on each of these skills, on broad educational 
reading and writing goals and on areas of learning or topics identified 
by the teachers. For each topic, steps indicating progress were identi- 
fied as the outcomes of instmction. These steps were also defined as 
the desirable attributes of students after instruction in each topic. 

This approach combined the identification of topics, the outcomes 
associated with each goal and a range of methods for gathering informa- 
tion or evidence of achie\'ement for each goal. Information-gathering 
methods were called assessment methods. These methods in turn were 
matched with the desirable student attributes for each goal or topic. 
The cN'idence of achiex ement of an outcome that was gathered by each 
assessment method was then written into a corresponding matrix cell. 
Each piece of e\'idence was called a *performance indicator . More than 
thirty of these matrices were dexeloped through teacher workshops. 
Table 8.1 shows one such matrix, illustrating the goal of de\'eloping an 
approach to new words. 

In the example in Table 8.1. teachers listed the outcomes, or .stu- 
dent attributes, and the assessment methods as follows: 

Outcomes, or student attributes: 

• seeking help from others: 

• using visual cues: 

• using auditoiy or graphophonic cues: and 

• using semantic and syntactic cues. 

Assessment methods: 

• direct obsenalion and anecdotal records: 

• listening to oral reading: and 

• conferencing with students. 

Teachers used the matrices as guides iox obsenalion in the classroom. 
They collected samples of students' work, records of obser\'ations. and 
other evidence of outcomes. These were moderated at subseciuent 
teachers* workshops. An a\*erage of six matrices were dexeloped at 
each workshop. The total set of matrices presented teachers not onh" 
with a detailed obsenation schedule, but also with a large number ol 
indicators. 
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Table 8. 1: Matrix worksheet: Approach to unknown words 



MILESTONES 



Ass6ssm6nt 




uses visual 


Uses auditory 


Uses context 


techniques 




cues 


cues 


cues 


Observation and 


asks adults/ 


moves eye 


reuses words 


rereads 


anecdotal 


peers what a 


between words 


already heard in 


sentence when 


records 


word is 


and pictures 


stories, wall 


unable to read a 








stories or oral 


word 








reading activities 






asks adults/ 


substitutes a 


uses first sound 


expresses that 




peers what a 


similarly shaped 


of a word when 


It was the 




word means 


word for the 


attempting a 


context that 






unknown word 


new word 


gave clues for 










the word 








attempts to 


queries meaning 








sound parts of 


of sentence 








words 


when unable to 










read word 


Running records 


asks adults/ 


uses appropriate 








peers for 


substitutions, 








meaning and 


e.g., house/ 








pronunciation of 


home 








a word 








Parent-teacher 


asks parent for 


states the 


states J word is 


rereads 


conference 


meaning and 


picture helped 


known because 


sentence when 




pronunciation of 


to read the text 


It sounds right 


unable to read a 




words 






word 










explains that it 










was the context 










that gave clues 










for the word 



Teachers in twenty-lwo schools ohsened and recorded the read- 
ing helniviours of 332 students using a set of matrices. To determine 
whether a behaviour was being exhibited, a rating scale was used to 
indicate the confidence level of the obser\ er. The number *()' was used 
to indicate that the beiiaviour described by the indicator was not ob- 
ser\ed. '2' was used if the obseiver was sure that the behaviour was 
exhibited, and T was used if the obsen er was not convinced that tiie 
behaviour was pail of a student's reading performance, 

ludkators of Rcadiu^u, Dcivlopmcftt 

The indicators were checked for accuracy and mapped onto an imder- 
lying development scale using item-response theor>' for rating scales 

i-'^ 13b 
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Table 8.2: A sample spread of behaviour indicators for assessing 'reading development' 



Attributes-level Indicators 



• Supports an argument or opinion on the text by reference to evidence presented m a 
variety of source:^. 

• Discusses and writes about on author's bias and technique. 

• Discusses the styles used by different authors. 

• Uses narrative and expository \exts appropriately. 

• Expresses and supports an opinion on whether an author's view is valid. 

• Uses a known text as a model for own writing. 

• Can find the main idea in a passage. 

• Follows written instructions. 

• Uses context as a basis for predicting the meaning of unfamiliar words. 

• Can retell a story with approximately the same sequence of events. 

• Can locate own name and other familiar words in a short text. 

• Turns one page at a time from the front to the back of the book. 

• Holds the book the right way up. 



(Andrich, 1978). Some interesting results were obtained. For instance, 
the indicators describing beha\'i()iir outside the teachers' direct obser- 
vation were Found to have erratic rating patterns. There were indicators 
of students" behaviour at home, their borrowing of books from pubhc 
hbraries and their reading activities outside the classroom. These indi- 
cators were excluded from the development scale. Table 8.2 presents 
a selection of the indicators from the de\'elopment scale. 

Of course, the examples in Table 8.2 are not an exhaustive or 
systematic list of all the ()bser\*able behaviours for assessing reading 
development. These indicators were selected from the calibrated list 
spread o\'er the full range of beha\'iours. A long checklist of indicators 
did not help either the monitoring or the analysis of the students' read- 
ing de\'ek)pment. It was also clear that these indicators were \*er\* closely 
placed on the continuum and thus were difficult to distinguish from 
one another. Further, because the reading behaviour depended on the 
context, content and purpose of the reading task, the relative positions 
of adjacent indicators could be unstable. 

The full list of indicators was examined for patterns that might be 
useful for summarizing them into bands or groups, a process that was 
similar to the aggregation of indicators in language-accjuisition scales 
such as the Australian Second Language Proficiency Rating Scale (ASLFR) 
(Ingram, I98f). Se\eral reading progressions, such as those illustrated 
in Fable 8.2, were identified in the list of calibrated indicators. All of 
these progressions could be empirically shown to fit into tlic overall 
development of reading behaviour. The difficulty was to put them into 
useable form. Progressions seemed to ix' related to underlying factors, 
such as: attitudinal behaviour, the influence of reading on writing. 
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role-playing, retelling l:)ehaviour. reactions to reading materials, analysis 
and interpretation, social or interactive roles in reading behaviour, word- 
approach skills, and types of reading materials chosen. 

Formation of Bands' 

Not all of the progressions could be traced all the w ay through the list 
of indicators from l:)Ottom to top. Thus l:)ands were formed by marking 
approximately equal inter\'als on the underlying scale. The indicators 
between the ec|ually spaced points on the scale formed the descriptive 
bands. The apparent underlying factors were used to ensure that each 
band contained a di\ erse and logically consistent set of indicators. A 
reading band represented a ver>' broad outline of reading behaviour 
rather than describing a discrete point of development. While se\'en 
bands were initially developed (Griffin, 1990), they did not represent 
anylhing other than apparent groupings of development indicators. They 
did not represent expected achievement levels for specific years or 
grades. The bands were labelled from A to G. Band A was set at the 
earliest developmental level in order to avoid the association of \ alue 
with devek)pment and to enable the scale to be extended later with 
other band levels as w^ork progressed with teachers at more advanced 
levels. The |-)ands were also cumulati\'e, in that a student developing 
behaviours at Ixmd E would l^e likely to ha\e established behaviours 
at lower bands and perhaps haw estal:)lished l:)eha\iours at higher bands 
(see appendix to this chapter for descriptions of reading behaviour in 
each band). 

Drafts of the reading bands were taken to schools for discussion 
with ^'achers, consultants, academics and others. They were asked to 
edit the reading bands so that they felt comfortable with their format 
and content. Modifications invoh'ed moving descriptors, editing their 
expression, adding new important indicators in the appropriate places 
or deleting those that did not add to the overall description. After the 
drafts had been revised by the various experts, further drafts of reading 
bands were prepared for field trials. Additional ad\ ice from teachers 
on recording and monitoring procedures led to another rating scale 
modelled on the levels identified in Reading On (Victorian Ministry of 
Education, 198S) and later in the Gnnvth A>f;/As* (Victorian Ministry* of 
Education, 1988). The recognized de\'elopment le\'els in that text were 
identified as beginning, developing and established. Teachers' familiar- 
ity with these terms and with the global, holistic, impression monitoring 
outlined in the Victorian reading text meant that familiar terms could be 
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Table 8.3: Expected and unexpected rating patterns for students v^ith different 



backgrounds 










Band 








Name 


A 


B 


L 


u 


t 


r 




David 


3 


3 


o 


o 

o 


Z 


z 


o 
z 


Renee 


o 


o 


o 

0 


o 

0 


o 
z 


z 


1 

1 


Fiona 


O 


o 


o 


o 


o 
z 


o 
z 




Shane 


O 


o 


o 
o 


o 


Z 


o 
z 




Scott 


3 


J 


o 
o 


o 

O 


z 


Z 


1 
1 


Maria 


3 


3 


3 


3 


z 


o 
z 


1 


Michael 


3 


3 


3 


o 


1 


z 


1 


Henry 


3 


3 


3 


2 


z 




1 


Stewart 


3 


3 


3 


3 


1 




1 


Rebecca 


3 


3 


3 


2 


1 




1 


Rosanna 


3 


3 


3 


o 
o 






1 


Cassy 


3 


3 


3 


3 


1 


1 


0 


Janelle 


3 


3 


3 


o 
o 


1 






Nicole 


2 


2 


3 


1 


1 






iNdly 


2 


1 


1 


1 


0 




1 * 


Kelly 


2 


2 


1 


1 


0 


0 


0 


Rosie 


2 


1 


1 


1 


0 


0 


0* 


Christine 


2 


1 


1 


1 


0 


0 


0- 


Pavoiia 


3 


0 


0 


0 


0 


0 


0- 


Petra 


2 


0 


0 


0 


0 


0 


0* 


Stephen 


0 


1 


1 


0 


0 


0 




Notes. • 


Non-English speaking background 










* » 


Integration student 












used for 


each band. 


A term 


was added for insufficient evidence 


of a 


particuh 


r Ix^liavioLir 


within ; 


I Ixind. 


giving a 


f()iir-i')()int scale. 





0 — no evidence 

1 — beginning 

2 — developing 

3 — established. 

In a pilot study of the bands, students were rated using the four- 
point scale. The .set of ratings for a .sample da.ss is shown in Tatile 8,3. 

Tnexpected rating patterns emerged from some class groups. >Xlieii 
the class results were analysed using a Rasch rating-scale model, some 
students were identified as developing in ways dissimilar to the re.st of 
the class (Andrich, 19''H). Tlie.se analy.ses are outlined elsewhere (Grif- 
fin and jones. U)88). In the .set of ratings in Table 8.3. tlio.se marked 
with a.steri.sks were identified as nii.sfiiting the underlying development 
scale, rpon di.scu.ssion with the cla.ss teacher, tlie.se results were attrib- 
uted to .students with non-Hnglisli-.speaking backgrounds or with spe- 
cific learning difficulties. A more detailed discussion oftlie.se is presented 
elsewhere (Griffin. 1989). 
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Field Trials 

The reading bands received extensive trials, Ihe Victoriau 100 school 
.s7//c/>' ( Rowe and Griffin, 1988), provided a formal oppoitunity to trial 
the reading bands as a monitoring and reporting instannent for gath- 
ering data on students' reading development in the following ways: 

• Teachers' ratings were used for the reading bands, 

• The Test of Reading Comprehension (TORCH) (Mossensen, L,, 
Hill, P, and Masters, G,, 1987) was administered, 

• ACER Primary Reading survey Test form AA (ACER, 1972) was 
used for year 1 students because the TORCH test does not 
apply to this level. 

The study provided the opportunity to examine the concurrent 
validity of the reading scale, the internal consistency of the teachers', 
ratings, and the relationships between the ratings and test scores at 
various le\'els of aggregation. Data on reading development was gath- 
ered on o\'er 5,000 students co\*ering years 1, 3. 5, 7 and 9, 

Messages for Teachers 

The analyses in Figure 8,1 shows that the bands are far from a stepwise 
secjuence of development. Different patterns of growth illustrate that 
de\'elopment is not linearly defined by the reading and writing bands. 
The shaded regions in the figures are di\'ided into two parts. The lower 
and upper shaded areas of each 'box' illustrate the regions with the 
highest probability that a teacher will rate the bahaviours of a student 
as beginning or 'developing' (a rating of 1 or 2 respectively) for each 
band level. Regions with the highest probability of a rating of no evid- 
ence' or 'established' (a rating of 0 or 3 respectively) for the behaviours 
of a student for each band level below and above the shaded regions 
respecti\cly. The regions indicate the 'expected' rating pattern for a 
student, given an overall estimate of the student's reading or writing 
development. 

Development is estimated numerically by a student's total rating 
score over all the bands, and the student s 'expected' rating pattern 
a.ssumes a predictable progression. A scale is shown on the left-hand 
side of each of the figures. Each shows e\enly spaced units and is 
defined by item response theor\' analysis, F.ach scale represents the 
development of reading or writing ixMiaviour, Each also represents the 
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Figure 3. 1: Reading bands and regions of most probable development 
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total score tor a sludcnl. olMained by adding the leadier s rating for 
llic student over all the reading or writing bands. Note tiiat increments 
of I in the total score do not represent even increments in reading de- 
velopment. Nevertheless, the figures show how a student's total score 
can be used to identify the .student's expectc^d' l')ehavi()ur for each of 
the Ixmds. A ruler held horizontally across the bands at a point on the 
scale that represents a student's total score will show how each band 
is likely to he represented in the student's reading or writing behaviour. 

More importantly. Figure HA shows tiie most proixilMe progression 
patteiiis in reading development. Since the boxes for the bands over- 
lap, it is unlikely that a student will fully establish any particular band 
level of behaviour before beginning to exhibit behaviours from the 
next \x\nd. An individual should tiierefore exhibit a broad range of 
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behaviours at any one time. It is expected that a student will exhibit 
behaviours covering about three bands at any one time. The frequency 
with which behaviours are exhibited will also vary. The identification 
of a specific development level defined by a single descriptive indicator 
(or a single band level or score) would, in all likelihood, be inaccurate. 
For a student with a total reading score of 11, for example, the most 
likely rating pattern assigned to the student would be 3, 3, 2, 1, 1, 1, 
0. Note that this differs from reading profiles of 3, 3, 2, 2, 1, 0, 0 for a 
second student and 3, 3, 3. 2, 0, 0, 0 for a third ,student. In the first case, 
the student is developing at band C and beginning to show indications 
of reading behaviour for bands D, E and F. If the instaictional target 
level is defined as the developmental level, then the instructional target 
range for this student would be activities associated with band C, In the 
.second case, the student is developing at bands C and D and would 
require a broader instructional programme. The third student is devel- 
oping at band D and would require a higher level of instruction. 



Conclusion 

The reading-development scales illustrate that it is possible to design an 
assessment and reporting system that is based on direct obserx-ation 
and w^hich harnesses and makes explicit the formative and intuitive 
assessment of classroom teachers. Throughout the development of these 
scales, group-moderation procedures were central to identifying the 
appropriate reading behaviours, defining these for the descriptors in 
the reading bands and choosing exemplar materials and assessment 
tasks relevant to each band in the reading progressions. In addition, 
teachers* judgments were statistically moderated by comparing the 
teachers* ratings with the students* scores on a reference test. Details of 
this study are reported elsewhere (Griffin, 1989). 

By involving teachers in an action research process it was possible 
to build an empirical base consisting of teachers* observations, which 
were used to define the growth of literacy in children, (The Writing 
Bands have not been discussed in this article although they were de- 
veloped by use of a similar process,) The resulting reporting frame- 
work makes communication possible between teachers and parents, 
between teachers and the school system and between teachers and 
teachers. The bands will enable descriptive reports to be written for 
employers. In fact, the bands prcnide a common framework so that 
.student growth and development can be monitored and the information 
gained can be communicated in a variety of ways without changing 
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standard interpretation of assessment data and without taking away a 
school's right to develop a curriculum to fit local needs. 
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Appendix: Literacy Profiles — Reading Bands 

Band G: Reads materials of \'ar\ing complexity, e.g., manuals, arti- 
cles, Shakespeare etc. Interprets simple maps, tables and graphs in the 
context of ex'idence presented. Makes generalizations, summaries, con- 
clusions from reading materials. Reads w idely for interest, to learn or 
for plea.sure. Reads at different speeds, using scanning, skim reading or 
careful reading as appropriate. Makes informed choices about reading 
materials. Can use a variety of resources to locate reading. vSupports 
argument or opinion on text by reference to evidence presented in a 
variety of sources. Compares information from different sources. Hx- 
Iracts and connects embedded ideas from complex sentences and para- 
graphs. Can analy.se the cohesiveness of text as a whole. Discussion 
shows an ability to identify opposing points of \'iew and main and 
supporting arguments in text. Discusses and writes about authors; bias 
and technique. Displays critical opinion and anahsis in written reports 
of reading materials. 

Band F: Discusses author* s intent for the reader. Makes links between 
arguments and ideas in text with personal experience. Discusses styles 
used by different authors. Forms generalizalions about a range of genre 
including myth, short stor>* etc. Offers reasons for the mood of the text. 
Writing and discussions show x'aried interpretations of text. Offers crit- 
ical opinion or analysis of reading passages in discussion. Verbally 
justifies appraisal of a text. Can select passages or phrases to answer 
(juesiions without careful reading of whole text. Formulates research 
topics and cjuestions and finds information recjuired from reading ma- 
terials. Uses narrative and expositorv' texts appropriately. Reads aloud 
with fluency and expression. Hxpands on and synthesizes information 
from a rani»e of texts in written work. Maps out plots, procedures and 
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character developments in novels and other texts. Varies reading strat- 
egies according to purposes for reading and nature of text. Makes 
connections between texts, recognizing linking themes and values. 

Band E: Improvises on a range of texts in role-play. Writing shows 
meaning inferred in the text. Explains a piece of literature and de- 
scribes settings. Expresses and supports a view on whether an author's 
point of view is valid. Discusses implied motives of characters in the 
text. Makes comments and expresses feelings about characters. Re- 
writes text in own words. Uses known text as model for own writing. 
Can record authors' implied messages. Uses a range of books and print 
materials as information.sources for written work. Reads to others with 
few inappropriate pauses. Uses suffixes, prefixes, and meaning of word 
parts to interpret new words. Can use directories such as a table of 
contents or index telephone and street directories to locate information 
in multiple sources. 

Band D: Discusses different types of reading materials. Reads mate- 
rials with a wide variety of styles and topics. Selects books to read for 
a purpose. Discusses materials read at home. Can find main idea in a 
passage. Tells a variety of audiences about a book. Recommends books 
to otheis. Uses vocabu!ar\' and sentence structure from reading mate- 
rials in written work as well as in conversations. Themes from reading 
appear in art work. Substitutes words with similar meaning when read- 
ing aloud. Begins to self-correct using knowledge of language staicture 
and sound-symbol relationships. Uses knowledge of language struc- 
ture and/or sound symbol to make sense of a word or phrase. Follows 
written instructions. Reads often. Reads silently for extended periods. 

Band C: Rereads a paragraph or sentence to establish meaning. Uses 
context as the basis of predicting meaning of unfamiliar words. Reads 
aloud showing understanding of purpose of punctuation marks. Uses 
picture cues to make appropriate substitutions. Uses pictures to help 
read a text. Concentrates on reading for lengthy periods. Writing and 
art work reflect understanding of text. Reads, retells, discusses and 
expresses opinions on literature. Seeks recommendations for books to 
read. Chooses more than one type of book. Can find where another 
reader is up to in a reading passage. Recalls events and characters 
directly from text. Chooses to read when given free choice. 

Band B: Takes risks when reading. Selects own books to read. \'er- 
bally describes connection among events in lexis. Writes, role-pla\s 
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and/or draws in response to a stor>' or otlier form of writing etc. (e.g., 
poem, message). Creates ending when tlie text is left open. Asks others 
for help with meaning and pronunciation of words. Recounts in writ- 
ing, drama or art work, or retells using language expressions from 
reading sources. Consistently reads familiar words and symbols within 
a text. Predicts words. Matches known clusters of letters to clusters in 
unknown words. Uses words in environment when reading and xs rit- 
ing. Shows knowledge of sound-symbol relationships when writing. 
Reads books with simple repetitive language patterns. Uses pictures for 
clues to meanings of text. Can recognize words within other words. 
Uses features and format of books in reading. Retells with approximate 
secjuence. 

Band A: Holds book the right way up. Turns pages from the front to 
the back. Can point to the beginning and end of sentences, and distin- 
guish between upper and lower-case letters. Can turn to the start and 
end of books. Responds to literature (smiles, claps, listens intently). 
Joins in familiar stories. Locates v\'ords, lines, spaces, letters. Refers to 
letters by name. Can locate own name and other familiar words in a 
short text. Identifies known familiar words in other contexts. Chooses 
books as free-time activity. Shows preference for particular books. 
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Summary 

This chapter presents a study investigating the reUability and 
validity of KSl SAT reading-assessment scores and the individu- 
ally administered word-reading tests of the British Abihty vScales 
and the Differential Ability Scales. The former had been stand- 
ardized on a nationally representative sample of uritish pupils 
in 19"76; the latter on a representative sample of American pupils 
in 19B9. The results cast doubts on the reliability, xalidity and 
utility of the NC SATs in assessing standards and progress in 
reading. 



Introduction 

Reading matters. It is an amplifier of human potential. The ability to 
read opens up to the individual the thoughts and feelings of other 
minds, past and present, here and in other countries, through the me- 
dium of text. Not to be able to read in our society is to be disadvan- 
taged and marginalized both culturally and economically. Parents have 
no doubts concerning the importance of reading. Teacliers and schools 
are expected by society to ensure that children become literate in the 
fullest sense of the word. How can we determine whether standards in 
reading are stal'>le or changing for better or worse? 



Context 



In 1989. an UMI report on Standards in HducatUnt made the following 
points concerning standards in primary* schools. 
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In inainiained primary schools, standards in basic language work 
are generally sound in that niost pupils are laughi the early 
stages of reading and writing systematically. Indeed, there are 
discernible improvements in both the c/nality and range of 
Wfitten work [my emphasis] due partly to the influence of the 
National Writing Project . . . Unfortunately there is still far too 
little stimulation and challenge for pupils to read more widely 
once fluency has been established. (DES, 1989, par. 9) 

Despite this reservation, at that time reading standards during the early 
stages of education did not appear lo be a cause of serious concern. 
Indeed, in view of the relationships between all aspects of literacy, one 
would have difficulty in understanding how the general standard of 
pupils* writing could increase if their general standard of reading was 
not doing so in tandem. Despite this, it is also possible that increasing 
means on reading test scores can be associated with an increasing 
number and proportion of pupils with low reading attainments. 

To a go\'ernment that had held power for the previous eleven 
years and is, in considerable measure, accountable to the electorate for 
standards in the state educational system. 1990 marked the beginning 
of a period of growing disquiet concerning the implementation of the 
National Curriculum in general and concerning the teaching and assess- 
ment of reading in particular. That disquiet continues in 1995. 

In 1990. on the basis of standardized reading te.sts results collected 
on successive cohorts by .some LEAs over many years, the possibility 
was rai.sed that reading .standards might be falling. Senior educational 
p.sychologists from nine edui\ition authorities released to the Times 
Hducational Snppletnent anonymous confidential data based on the 
test .scores of nearly 400.000 7-year-old pupils. It was as.serted that, over 
a period of years, there had been a .statistically and educationally sig- 
nificant decline in the mean reading-te.st .scores of successix e cohorts. 

The psychologists met to di.scu.ss their concern on 22 June 1990. Of 
nine authorities repre.sented, eight seemed to show a decline. The ninth 
is a small authority which re-standardi.ses its test each year, hut believes, 
nevertheless, that there is nc^ decline ... It remains the case, though, 
that conclusive evidence of a single authority without .such a decline 
has yd to appear (Turner. IWOa). 

Cx)mmenting that The i.s.sue is too important for the facts to be 
locked away in a drawer, John MacGregor. the then Secretary of State 
for Education, demanded to .see the data. In fact, the evidence from 
Croydon LEA was already in the public domain. On (> July their chief 
inspector is reported as confirming that the mean reading-test .scores of 
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T-year-old pupils in the Borough had dechned since 1985. A week later, 
four out of eleven educational authorities contacted by the Times Edu- 
cational Supplement weri:^ reported as having reading-test results which 
suggested a drop in reading standards among 7 year-old-pupils (Castle, 
1990). 

In September 1990, the Independent Primary and Secondary 
Education Trust, formerly the Education Unit at the Institute for Eco- 
nomic Affairs, published a booklet describing the findings (Turner, 
1990b). The report claims that there is clear evidence that hundreds of 
thousands of children in nine LEAs south of a line from the Mersey to 
the Wash are subject to a sharply downw-ard trend in reading attainments 
at the ages of 7 or 8. It is also claimed that there had been a 50 per cent 
increase in the number of pupils who can be described as 'very poor 
readers'. This unwelcome claim resulted in a severe decline in the 
standard of public discussion of this claim and of the evidence bearing 
on it. 

To any government, an assertion headlined 'Reading standards fall!' 
is a potential political disaster. It arouses public interest and concern 
equivalent to The Titanic has sunk!'. The speed with which the then 
Secretary of State for Education despatched HMI to investigate, and 
simultaneously involved the NFER in a complementary' survey of read- 
ing attainments in the autumn of 1990, underlined the extreme political 
sensitivity of the issue. HMI produced a report by the end of the year 
(December. 1990) based on 3.000 inspection visits, involving 120 pri- 
mary schools where HMI observed the teaching and learning of reading 
in 470 classes and listened to over 2,000 pupils read aloud. The pupils 
were in National Curriculum Years R, 1, 2, 3 and 6. Pupils' reading 
fluency, accuracy and understanding were judged using HMI techniques 
(Rose, 1992). This inspection provided part of the evidence on which 
HMI's judgments were made concerning reading standards and the 
effectiveness with which reading was being taught in their published 
report (DES, 1991d). The widespread availability of standardized reading- 
test results in primary' schools was not apparently used to moderate 
HMI's conclusions concerning reading standards. 

The evidence from such testing cannot contribute to a relial'>le 
national picture of reading standards: or the direction in which 
ihey are moving or if they are moving at all. (ibid., par. 7) 

I-A'en when classroom obsenations and Iiearing children read are 
carried out by highly experienced professionals such as HMI, are we 
not, once again, being invited to give too much weight to empirically 
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unsupported, somewhat subjective assessments concerning reading 
standards? 

At the same time as the HMI carried out its surve\', the School 
Examinations and Assessment Council (SEAC) was asked to carry' out a 
sur\'ey of the arrangements made by LEAs for monitoring reading 
standards. The NFER undertook this work and concentrated on NC 
Year 2 assessment practices. Their report entitled An Enquiry into LEA 
Evidence on Standards of Reading of Seven Year Old Children makes 
interesting reading (Catto and Whetton, 1991). All 116 LEAs in England 
and Wales were approached. Thirty-six had not carried out LEA-wide 
testing of reading since 1981. Two LEAs refused to take part in the 
study. Ninety-five LFiAs provided information. Of these, since 1981, 
fifty-nine had carried out authority-wide testing of reading in .some 
years during the period up to 1990. The instalments used are considered 
critically and a number of weaknes.ses identified. A central, albeit highly 
controN'ersial, as.sertion made was that The te.sts reflect outdated and 
narrow conceptions of reading, and certainly do not match the pro- 
ces.ses of reading defined in English Attainment Target 2 of the National 
Curriculum* (ibid., p. 22). 

The .standardizations of the te.sts were criticized, but for the purpo.ses 
of large-scale monitoring of reading attainments, it was considered that 
technical deficiencic's were not so great as to in\'alidate their use for this 
purpose. 

Some of the tests are now \'eiy old and the content of these 
often includes words which would be difficult for modern chil- 
dren. This alone could account for a decline in .scores. For the.se 
rea.sons it would not be adx'i.sable to u.se any of the te.sts as 
measures of reading as defined by the National Curriculum. They 
are not appropriate or adequate for this purpo.se (ibid., p. 22). 

The report concludes: 

The attempt to discern a national picture of pupils* performance 
from di.sparate methods of a.s.se.ssment highlights the need to 
have national as.scs.sment procedures with agreement as to 
the forms in which test results will be analysed and reported, 
together with a "-^tatutor}' ol)iigation to make data publicly avaii- 
abl.. (ibid., p. 1()8) 

F.arly in IWl, the House of Commons Select Committee on edu- 
cation, science and arts undertook an examination of the evidence and 
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issues raised by the reading vStandards controversy. In iheir subsequent 
report, it was stated: AVe conclude that the claim that reading standards 
have fallen in recent years has not been proved beyond reasonable 
doubts' (pan 12). 

The Governiiieni responded to the Select Coniiiiittee's report 
noting the above point. The Go\'ernnient further commented: 'The 
national interest requires that standards of reading should rise in pace 
with society's ever more demanding needs for literacy; it is not enough 
that standards should not be falling. All schools should strive to match 
the standards of the best, and LEAs should ensure that reading stand- 
ards in their area are kept under control. 

How might this be done? It was intended that the National Curric- 
ulum asses.sment arrangements based on 'clear national targets for 
pupils* reading attainments across the full age range for compulsory' 
education' would be tfie way fonvard. This leads to the central role of 
Standard Assessment Tasks (SATs). It also raises the question as to 
whether reading SATs can provide in one measure the variety of infor- 
mation teachers and parents reciuire in order to monitor and enliance 
children's reading attainments and progress in reading. 

The basis of the SAT approach to the assessment of reading stand- 
ards and progress is clearly somewhat different from that underpinning 
the use of standardized word-reading tests such as the British Ability 
Scales (HAS) and the Differential Ability Scales (DAS). One of the major 
emphases of the reading-assessment procedures developed for National 
(Airriculum reading SATs has been the importance of ensuring that the 
assessment materials are as naturalistic* as possible, in book form and 
with a story content that is culturally familiar. Such requirements 
are based on the assumption that children's reading is predominantly 
context-dependent, and that, if faced with anything unfamiliar in either 
form or content, children will underperform. 

In part, as a consecjuence of this view concerning the development 
of leading, normativ e word-reading tests have been severely criticized 
during the last decade. Their form and content were said by some 
critics to be of little utility to teachers and parents in relation to the 
assessment of standards and progress in reading of individual pupils 
and groups. Many of the criticisms have been shown to be invalid. 

An extensive review of research into the early stages of learning to 
read, supported by the I S Office of Educational Research and the 
Reading Research and Education Centre at the Centre for tl:e Study of 
Reading based at the University of Illinois, Trbana-C^hampaign, under- 
lines this point. In skilled reading, gcncratizahitity and antontaiicity of 
reading skills arc of the essence. \ . . the ability to read words, cjuickly. 
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accurately, and effortlessly, is critical to skillful reading comprehension 
— in the obvious ways and in a number of more subtle ones . . . unless 
the processes involved in individual v^ord recognition operate prop- 
erly, nothing in the system can either (Adams, 1990, p. 3). Context cues 
alone are insufficient to optimize reading development and improve 
reading attainments and progress. 

Testing a pupil's ability to read decontextualized words, as in word- 
recognition tests such as the BAS and the DAS, is not necessarily a 
pedagogic sin; it is much more likely to represent an educational virtue. 
A test of decontextualized word reading does not have the same face 
validity' (often referred to as 'ecological validity ) as reading SATs in 
that the former does not reflect real-world reading tasks. Despite this, 
word-reading tests can efficiently provide valid empirical information 
about standards and progress of individuals and groups in the decod- 
ing skills that underpin reading comprehension. (The pedagogy whereby 
automaticity in early reading can be best achieved is a separate issue 
from that concerning the respective uses and limitations of the reading 
SATs and decontextualized word-reading tests). 

Aspirations 

Concern about pupils' educational standards in general and reading 
attainments in particular, are matters of perennial interest. In England 
and Wales, the ii.aplementation of the Education Reform Act 1988 es- 
tablished a National Curriculum. Its prime purpose was to raise educa- 
tional standards. Each individual pupil's attainments and progress were 
to be regularly monitored at the end of the four Key Stages into which 
tlie curriculum was divided. Key Stage 1 covers the chronological age 
range from 5:00 to 7:06 \'ears. These pupils were the first to whom 
National Curriculum English was taught, reading standards assessed 
and subsequently reported in national league tables covering all local 
education authorities (DES, 1991c; DfE, 1992). 

Englisli is one of three core' subjects in the National Curriculum. 
Within English, reading' is one of four components. In theor>\ the 
Standard Assessment Tasks provided common assessment procedures 
whereby valid evidence concerning standards and progress in reading 
can be systematically, comprehensively and annually collected for every 
pupil attending state schools at the end of each Key Stage. The form, 
content, administration, scoring and reporting of National Curriculum 
English Profile Cx)mp()nent 2 reading' of KSl SATs have been clearly 
presented, iuuli year they have been modified in respoase to criticisms 
(SEAC, 1991, 1992, 1993). 
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The Parents' Cfe^^/rer makes explicit parents* rights and respons- 
ibilities in relation to the education of their children. 

The Government's plans are designed to ensure that you have 
all the information you need to keep track of your child's 
progress, keep in touch with his or her school and compare all 
local schools. (DES, 1991a) 

The point of the new tests is to give you and the teachers an 
exact [sic] picture of what your child has learned. Knowing how 
your child measures up against national sl^ndards will give you 
the best idea of his or her real progress. (DES, 1991b, p. 2) 

On the one hand, these intentions and claims reassure us that the 
Government, the then Department of Education and Science and the 
current Department for Education are concerned with accuracy when 
measuring children's real progress against national standards. On the 
other hand, it is possible that the confident tenor of the above claim is 
inversely proportional to its validity, given the SAT-based assessment 
procedure that was to be used. 

Put simply, on the evidence currently available, the claims for the 
quality of the information provided i:»y the reading SATs probably con- 
travenes the Trades Descriptions Act. With reference to the procedures 
to be adopted for the assessment of reading standards, for the Govern- 
ment, through its agents the School Examinations and Assessment 
Council and its successor body the School Curriculum and Assessment 
Authority, to make a mistake is forgivable. To make a mistake because 
of inadequate consultation, is less forgivable. To refuse to reconsider a 
mistake made because of lack of consultation, would be unforgivable. 

In order to allay the many hostilities that had been aroused by the 
massive workloads placed on teachers as a consequence of change 
upon change to the National Curriculum, Sir Ron Dearing was brought 
in hy the Government. In a speech given in May 199,-^, to teachers in 
the West Midlands, he is reported as saying 'When you hear a class 
teacher with thirty-five KSl children just coming up to the age of se\ en 
saying you have to complete 10,391 standard assessments, you realize 
what is inx'olved*. SATs at KS 1 are complex and time-demanding. They 
have been severely criticized |-)y teachers and research workers (NUT, 
1991; Pumfrey and Elliott, 1991; Smithson, IWl; Bennett et ai. 1992; 
Pumfrey, 1992; Sainsbury et ai. 1992; NUT and the School of Ixiuca- 
tioii, University of Leeds, 1993). 

This study, carried out in collaboration with Drs C.I). Elliott and 
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S. Tyler, investigated the reliability and validity of the Key Stage 1 SAT 
reading scores in relation to pupils' scores on two individually ad- 
ministered word-reading tests. These were the British Ability Scales and 
the Differential Ability Scales {V\\\o\\, 1983; 1990). The former had been 
standardized on a nationally representative sample of British pupils in 
1976; the latter had been standardized on a representative sample of 
American pupils in 1989. 

We were interested in two related questions. The first was whether 
there had been any changes in the word-reading attainments of a rep- 
resentative sample of English children over fifteen years. The second 
was to compare children's scores on two word-reading tests and on the 
NC En PC2 Reading SATS at KS 1. 

Evidence of a decline in mean word-reading test attainments in the 
sample at this age level was not identified on the BAS at Key Stage 1. 
IlowxA'er. a comparison with the more recently standardized American 
DAS results suggests that complacency is not appropriate. The results 
also cast doubts on the reliability, validity and utility of the National 
Curriculum SATs in assessing standards and progress in reading. 

Reference is made to two ongoing studies at the University ot 
Manchester School of Education following up and extending the above 
research into reading standards. Standard Assessnient Tasks and the 
National Curriculum. 



Method 

A socio-economically stratified nationally representative sample of "i'ear 
2 pupils was drawn from the schools within a large local education 
authority. All members of the sample had completed their SAT reading 
assessment in June 1991 (N = 209). The SAT assessment procedure and 
discussion culminated in a crude four-point ranking of groups: Work- 
ing towards level 1 (W), levels 1, 2 and 3 and above. 

Results 

III the following November, each pupil completed the word- 
reading tests of the l^AS and the DAS. Order of presentation and sex of 
the subjects were counterbalanced in the administrations. Hiirteen cjuali- 
fted, experienced teachers and an educational ps\'chologist, all having 
been trained in the administration of the tests, collected the data from 
twelve schools. 
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Table 9. 1: Sample characteristics 



Sample 



N = 209: 107 boys. 102 girls 
Year 3; age range 87-99 months 

• 71 children in 4 schools with advantaged catchment area 

• 60 children in 4 schools with mixed catchment area 

• 78 children in 4 schools with disadvantaged catchment area 



Table 9.1 summarizes the composition of the nationally representa- 
tive sample of pupils. Pupils' scores on the National Curriculum English 
tests were compared with the the national figures for the four grades. 
The national figures for 1991 presented by the Department of Educa- 
tion and Science showed 71 per cent of pupils at level 2 or above and 
21 per cent at level 3. For the whole of the LEA from which the current 
sample was identified, the overall percentages w^ere 77 per cent at level 
2 or above and 26 per cent at level 3, respectively, i.e., somewhat 
higher' SAT reading scores than the national averages. 

In contrast, for the nationally representative selected sample within 
the LEA, there were 70 per cent pupils at level 2 or above and 19 per 
cent at level 3. This approximates the national figures. It is necessary' to 
keep in mind that the errors of measurement associated with the SAT 
scores are not as yet known. The assumption is that they are likely to 
be considerable. 

The importance of word-reading abilities and te,st scores in relation 
to the development of literacy, has been referred to in the 'Context* 
section above. The sample s scores on the BAS and DAS were compared 
using analysis of variance. This demonstrated that the main effects of 
order of presentation and sex were not statistically significant, although 
the girls* mean scores were slightly higher. The first order interaction 
combining the effects of order and sex was also non-significant. 

By comparing the mean scores of the sample on the BAS and the 
DAS with the normative data obtained in 1976 and 1989 respectively 
(see Table 9.2), two interesting comparisons can be made. First, it is 
possible to see whether standards on the I^AS have changed over a 
period of fifteen years. It is also possible to compare the performance 
of this sample of English pupils with the scores of American pupils on 
the more recently- standardized DAS word-reading test. 

Three points deserve consideration. Hie first is that the mean scores 
of pupils on the BAS has not altered significantly from that obtained by 
the standardization sample in 1976. However, a second point is of con- 
siderable relevance to the is.sue of reading standards in England. There 
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Table 9.2: Total sample results on BAS and DAS 




Total Sample 


• BAS Word-reading mean = 100.2 




s.d. = 18.0 




• DAS Word-reading mean = 98.5 




s.d. = 18.2 




t = 4.45 df = 208 p < .001 




Table 9.3: Total sample: SAT, BAS and DAS intercorrelaVons 


Total Sample 


Correlations 




BAS 


DAS 


SAT .67 


.66 


BAS 


.95 



appears to he an increase in the standard deviation of the scores on the 
BAS. assuming our sample to he nationally representative. It is likely 
that, even though the mean scores have remained stahle. there has 
heen a increased spread of scores ox er time. Thus the presence of more 
high-scoring pupils would counterhalance the increase in lower-scoring 
pupils. In summary, the numher and proportion of pupils with low 
word-reading attainment test scores may have increased despite the 
apparently stahle mean score. The third point concerns the comparison 
of the sample\s scores on the DAS and the American standardization 
sample results, A small hut statistically significant mean-score difference 
was identified. The standard deviation of DAS scores for the Fnglish 
sample was similar to that ohtained on the BAS. There can he no 
groimds for complacency over standards of word-reading skills at KS 1. 

The reliabilities and validities of the BAS and the DAS are well- 
estahlished. The correlations hetween pupils* SAT reading scores. BAS 
word-reading lest and DAS word-reading test scores are presented as 
Table 9.3. These figures support the high reliabilities and concurrent 
validity of the BAS and the DAS. The reading SAT score includes read- 
ing accuracy scores based on particular words correctly read by the 
pupil in the context of a short stor\'. The semantic and syntactic com- 
plexities of the stories differ markedly (Morris. 1992). These and other 
im([uantified sources of error variance exist between the stories from 
which the teacher is allowed to select. There are also other components 
of reading that contribute towards the reading SAT result. It is highly 
likely that, partly as a consecjuence of uncontrolled and unc[uanlified 
sources of error variance, the SAT reading assessment is likeK' to be of 
much lower reliability llial that of the l^AS and DAS. In addition, the 
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Table 9.4: BAS scores and socio-economic status of school intake 



BAS Standard Scores 





Mean 


S.d. 


Range 


RA 
Range 


Advantaged schools 


108.1 


12.6 


78-135 


6:0-14:00 


Mixed schools 


103.5 


19.3 


69-135 


5:5-14:05 


Disadvantaged schools 


90.4 


16.8 


66-135 


5:0-11:10 



restricted four-point range of the reading vSAT tend.s to reduce the 
covariance with otiier measures having a much wider degree of 
inierindividual v ariance. In view of such criticisms, can the reading SAT 
results provide the quality of infonnation that the government has sug- 
ge.sted is expected? 

Varied aspects of reading are tapped .somewhat unsysteniatically 
and, in parts, .subjectively using reading SATs at KSl. It is argued that 
lhe.se reading .skills require the pupil to have internalized and auto- 
mated the processing of a number of words that permeate pupils' tex- 
tual materials and are encountered and read with great frecjuency. 
Limitation in the child's working memoiy re(iuires automatization in the 
proces.sing of the surface features of text as a neces.sar\' but not suffi- 
cient condition for the meanings within the text to he efficiently and 
rapidly con.structed. Pupils who are in the early .stages of learning to 
read may well have to in\'est considerable conscious effort in using the 
grapho-phonic and/or syntactic a.spects of text, in contra.st to compe- 
tent readers who do not. The capacity of working memor\' is limited. 
Reading words out of context is a valid index of generalizability of an 
important skill underpinning efficient reading. 

Turning to the BAS .scores of the sample, their relation.ship to the 
.socio-economic status of the .school intake is shown in Table 9. t. The 
above pattern of BAS .scores has been replicated in many other siudies 
using other reading tests in other cultures and at other times (Pumfrex', 
1990: Lake, 1992: Lewis, 1992). On average, the higher the .socio- 
economic .status of the catchment area of the .school, the higher the 
pupils* mean BAS .scores. Why this should be tlie case and what the 
findings indicate in relation to raising reading .standards, will continue 
to be highly controversial. What is clear from the data is the wry con- 
siderable range of inierindiv idual differences on HAS .scores inthin each 
.school. Whate\er it is thai enables .some pupils to learn more rapidly 
than others, it is to be found in pupils in r/// .schools irrespective of the 
material acK aniages or otherwise of their intakes: but it is riot cv///r///r 
present in all .schools. 
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Table 9 5: Reading SAT levels across three categories of school 



SAT Levels (per cent) 



Level 


Advantaged schools 


Mixed schools 


Disadvantaged schools 


Approaching 








level 1 


0 


2 


4 


Level 1 


22 


25 


36 


Level 2 


55 


40 


56 


Level 3 


23 


34 


4 



Table 9.6: Range of BAS word-reading scores at each SAT level 



BAS Word-reading Scores 



RA 

Mean s.d Range Range 



Approaching 

level 1 67.8 2.9 66-72 5.0-602 

Level 1 862 13 7 66-120 5:0-9:06 

Level 2 101 5 14.0 69-135 5.7-12:09 

Level 3 120 1 102 99-135 7 7-14:05 



A similar pattern (see Table 9.5) is reflected in the percentage of 
pupils at each of the four reading SAT le\'els across the three categories 
of school. 

The mo.st challenging results were those in which the means, 
standard de\'iations and reading-age ranges of BAS word-reading test 
.scores were compared across each of the four le\ els of reading attain- 
ment identified by the SAT reading-as.sessment procedure. These results 
are set out as Table 9.6. A SAT le\'el 1 reading assessment can be at- 
tained b\' pupils ha\-ing a BAS reading age range of from between 5:00 
to 9:()(> \'ears. A SAT le\'el 2 reading assessment can be obtained with a 
BAS word-reading test reading-age range of between '^Xr to 12:09 years. 

Subsequent work on reading comprehen.sion test .scores and SAT 
NClHn .2 Reading lewis indicates a similar cn erlap (Da\'ies and Brember, 
in press). 

It is contended that SATs do not provide the (juality of sumnuitiiv 
infonnation from which standards and progress in the reading attain- 
ments of individuals and groups can be validly as.ses.sed. They cannot 
inlorni us whether or not the National CXirriculum Hnglish programme 
is affeiling reading attainments. For summati\'e purposes, the e\'idence 
presented here suggests strongly that well-constructed and standard- 
ized norniatiw reading tests have much more to commend them than 
SAT reading assessments. 
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Table 9.7: Range of reading-comprehension test scores at each SAT level for 1991 





Sample 


Level 1 


Level 2 


Level 3 


n= 

Reading score 
Reading age 
Comprehension 


176 

5-39 
-6-10.75 
60-131 


35 

5-27 
-6-7.05 
60-100 


106 
17-35 
-6-9.25 
81-125 


35 
24-39 
6.75-10.75 
91-131 


Notes: ~6 indicates a reading age below the cut off point of 6. 
N = 176 (5 randomly selected schools) 




Table 9.8: Range of reading-comprehension test scores at each SAT level for 1992 




Sample 


Level 1 


Level 2 


Level 3 


n= 

Reading score 
Reading age 
Comprehension 


171 

1-37 
-6-10.00 
40-131 


27 

1-32 
-6-8.05 
40-105 


88 
10-32 
-6^.05 
69-1 1 5 


56 
18-37 
-6-10.0 
85-131 



Notes. -6 indicates a reading age below the cut off point of 6. 
N = 171 (5 randomly selected schools) 



The early developnieni of pupils' reading abilities is, in large 
measure, dependent on a inaster>' of the alphabetic principle. It repre- 
sents a foundation without which other aspects of reading are unlikel>' 
to develop. Autoiiiaticily is important, overlearning is essential and 
mastery matters. Technically sound instruments, such as the BAS and 
the DAS word-reading te,sts, based on the accurate decoding of de- 
contextualized words, represent a more x alid foundation for summalive 
assessment of standards and progress than do SAT reading assessments. 
The latter are incapable of delivering the information their proponents 
promise. 

The study involving BAS, DAS and reading SATs was replicated in 
1992. albeit under more difficult circumstances. The results are being 
analysed. Initial indications are that they show considerable stability in 
relation to the word-reading test results from the previous year (Pumfrev, 
1993). 

Other work at the l^niversity of Manchester compares reading SAT 
levels with scores from a standardized test of reading comprehension, 
the NFHR Primary Reading Test. The findings cast further doubt on the 
(juality of the information provided by the reading SATs at KS 1, insofar 
as summative assessment is concerned. Preliminary results from the 
1991 and 1992 cohorts (see Table 9.7 and Table 9.8) based on 176 and 
171 pupils attending five randomly selected schools within an LHA make 
the point (Davies. 1993). 

In presenting these preliminaiy analyses, the limitations in the 
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sampling, data and analyses are acknowledged. Despite these weak- 
nesses, the argument that SATs are of utility in summative assessments 
at Key Stage 1, enabling standards and progress in reading to be gauged 
for the individual, the school, the LEA and the nation, is open to serious 
question. 

Messages for Teachers 

The assessment of children's reading standards, attitudes towards read- 
ing and their progress in the early stages of learning to read can pro- 
vide valid indices of the effectiveness of an educational system. 
Monitoring both cognitive and affective aspects of reading develop- 
ment requires a simultaneous consideration of the objectives of the 
reading programme, the curricula, resources and methods whereby these 
objectives will be achieved and the assessment of pupils' progress to- 
wards the stated objectives. 

'Whatever exists, exists in some quantity and can, in principle, be 
measured.' Thus spoke Thorndike, one of the pioneers of applied psy- 
chology and expert in educational measurement. More importantly, the 
measurement of children's inter and intra-individual differences in 
reading products and underlying processes can be undertaken to the 
advantage of all concerned. 

In the nineteenth centur>'. Lord Kelvin had earlier stated an im- 
portant principle. It is pertinent to virtually any field, including that of 
reading. When you can measure what you are speaking about and 
express it in numbers, you know something about it; but when you 
cannot measure it. when you cannot express it in numbers, your 
knowledge is of a meagre and unsatisfactory kind.' Sadly, measurement 
and numbers can also be used as a smoke-screen to hide our lack of 
imderstanding of the reading process and its development. 

In December 1992. Professor Stuart Sutherland, Her Majesty's chief 
inspector of Schools in England and Wales and head of the Office for 
Standards in Education (OESTED) commented on the management of 
education as follows. 1f you can t measure it. you can t manage it.' 

The informed use of reading tests and assessment technicjues can 
contribute towards improving the reading attainments and progress 
of pupils. Despite the limitations of testing and assessment, they are 
essential if the educational system is to demonstrate its accountability. 
I'nfonunately. the value of reading tests is fre(|uently under or overvalued 
by users. This can Ik* a consecjuence of a restricted appreciation of their 
conceptual bases, available sources, technical strengths and weaknesses 
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and their legitinvalc uses. It can also derhe from limitations in our 
conceptualization of the reading process and its dex'elopinent. 

Before using r/;/rforni of reading-assessment procedure, ask, and 
answer, the follow ing four questions. Dependent on how adequate you 
and your colleagues consider your answers to be, decisions can be 
made whether to: 

• use the test or technic[ue: 

• modify it. justifying any modification made; 

• reject its use. making clear your reasons for so doing; or 

• defer a decision. 

The final option enables one to exj^lore the question and the basis 
of one's answer in greater depth. 

1. For what individual and/ or institutional decision-making pur- 
poses will the information that I collect be used? 

In relation to reading-assessment results, one major distinction is whether 
the information obtained is to be used for institutional decision-making 
purposes or for indi\*idual decision-making purposes. The former typic- 
ally concerns the assessment of groups of pupils. The data can be 
anonymous. The focus is on interindividual differences in readitig-test 
scores. The.se can be used to produce league tables' comparing trends 
in national reading standards and also standards over time in a school 
or a class, the comparison of groups of pupils such as boys and girls, 
and in selection procedures. In contrast, individual decision-making is 
oriented towards the identified individual student's particular pattern of 
intra-indi\*idual differences, luul to the instructional implications of the 
patterns identified. 

1. Does the assessment techni([ue provide valid indices of read- 
ing stimdards and progress? 

In Hngland imd Wales. National Clurriculum Fnglish includes expressive 
skills, such as oracy and writing (including spelling), and receptive 
skills, such as listening and reading. Their developmental interrelation- 
ships are complex. 

There are many methods of assessing children's reading attain- 
ments, attitudes towards reading and progress in reading. It is import- 
ant that teachers understand the rekitive strengths and weakness of the 
following assessment procedures in relation to the stated objectives of 
the reading programme; 
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• obseivaiional procedures; 

• infonnal reading in\'entories: 

• normative assessments: and 

• criterion-referenced assessments. 

In 1991. Vyc Parefit's Chcoier siait^d The Government's plans are 
designed to ensure thai you ha\'e all the information you need to keep 
track of your child's progress, keep in touch with his or her school, and 
compare all local schools.* Standard Assessment Tasks and teacher 
assessments have been de\'eloped by the Schools Examination and 
Assessment Council on an inadequate theoretical and empirical basis. 
In contrast, there exists a well-developed iheon* of normati\'e reading- 
test construction. 

The comparison of the results of a NCEn2 (reading) Standard As- 
sessment Task and that of the BAS and DAS word-reading lest results 
highlights the important differences between an apparently simple test 
of word reading constructed according to an explicit theon* of test 
construction and that of a SAT constructed on much more subjecti\e 
and less explicit criteria. The undedying assumptions concerning the 
nature of the phenomenon l')eing tested should be made explicit by 
advocates of any reading test or assessment technique. 

The ten-point scale of reading attainments on which the SAT NCKn2 
(reading) level is based is inadequate. It cannot allow the valid inter 
and intra-individual discriminations on which differentiation of the 
curriculum depends. The claim that SATs can provide 'formative, 
summati\'e. evaluati\'e and informative* assessments concerning both 
standards and progress in reading of indi\*iduals and groups at KSl is 
highly questionable. 

3- Is the collection of ihe.se data a wonhwhile use of pupils' and 
my time? 

'The point of the new tests is to gi\*e you and the teachers an exact 
picture of what your child has learned. Knowing how your child meas- 
ures up against standards will gi\-e you the best idea of his or 
her real progress' (I)FS, 1991, p. 2). 

An understanding of the nature of reliabilities and validities and 
their interrelationships is essential if a user is to understand the merits 
of a reading test or assessment technique. The means of assessing XCEn2 
(reading) SAT appears to be changed each year. This makes compari- 
sons across years virtually meaningless. An inordinate amount of teacher 
time has been swallowed up in obtaining information on reading at 



150 



Reading Standards at Key Stage I in Schools in England and Wales 



KSl using SATS, Many teachers assert that this time could have oeen 
more constructively used in teaching their pupils. 

4, As part of my continuing professional development, what ideas 
do I need to follow up in order to enhance my understanding 
of the development and assessment of children's reading skills? 

Teachers have both an individual and a collective professional respons- 
ibility to conceptualize and control the development of reading to the 
benefit of all their pupils. In discharging this responsibility, never forget 
the uniqueness of the individual pupil. Remember that generalizations 
deriving from the reading attainments and progress of groups frequently 
'decay' when one is concerned with understanding the individual pupil's 
instructional needs. 

As one component of the broader field of literacy, an ongoing 
awareness of current research into the nature of reading and its devel- 
opment is essential. This should be matched by an enhanced under- 
standing of the technicalities involved in reading-test construction, the 
assessment of both reading standards and progress, and the interpreta- 
tion of the results obtained. 

When it comes to the assessment of cognitive, affective and mo- 
tivational aspects of reading development, beware the bureaucratic 
'tail' attempting to wag the pedagogic 'dog'. Above all, members of the 
teaching profession must develop their professional expertise and use 
their critical acumen in this and in all other curricular fields. 



Conclusion 

On 29 Ma\ 1992, a report in the Times Hducational Supplement was 
headed \Scols SATs abandoned'. Apparently the Government proposed 
dropping national tests for 8 and 1 l-year-olds following widespread 
objections from parents and teachers. 'In a consultation paper issued by 
the Scottish Office yesterday, it was conceded that criticism had led to 
the Government reviewing its position.' A more efficient and cost effec- 
tive strategy involving the use of teacher assessment linked to Con- 
firmatory Threshold Testing, as in Scotland, could provide the formative 
information on reading development tiiat teachers re(|uire. 

Theoretically based empirically grounded systematic observation, 
assessment and testing are important professional skills whereby chil- 
dren's reading attainments and progress can be appraised. The three 
major complementaiy approaches involve systematic obser\ations by 
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experienced members of HMI (Rose, 1992), the use of objective reading 
tests and, currently, the National Curriculum reading SATs approach. In 
relation to .s7/mw^(/u'e assessment of standards and progress, normative 
(and domain-referenced mastery) objective reading tests have import- 
ant roles that have largely been neglected to date. The history of effi- 
cient national monitoring of reading standards based on standardized 
objective reading tests organized by the then Ministry- of Education that 
was operative here for many years, has either been forgotten, ignored 
or dismissed (Punifrey, 1992). 

In the Autumn of 1991, the results of the National Curriculum KS 
1 assessment in England were published in the form of 'league tables' 
for English, mathematics, science and reading (NO En Profile Compon- 
ent 2), The process was repeated in 1992 (DfE, 1992). Based on the 
SATs reading results, the percentages of KS 1 pupils at levels 2 and 3 
have increased. This has l')een claimed by the Government as an indica- 
tion of an improvement in reading standards and a vindication of its 
curricular policies and assessment procedures. In view of the inherent 
instabilities within the reading SATs procedures and the changes in 
content and procedure that have taken place between 1991 and 1992. 
this claim is suspect. 

Despite unprecedented expenditure. including a Hood of publica- 
tions, SEAC has not met its brief in connection with the core NC Profile 
Component of Reading, If valid infcMination on standards and progress 
is required, the form and content of vSATs require drastic amendment. 
On the basis of the information collected at KS 1 using reading vSATs, 
we cannot know whether reading standards of the individual, tiie .scliool. 
the UrA. or the national cohort are slal'>le. rising or falling. 
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Chapter 10 



Teachers as Participants in the 
National Reading Examinations 



E. Meiselles 



Summary 

In response to some of the criticisms against recent national 
examinations in reading. Israel's Ministry of Education has 
embarked upon two conceptual changes in the design of new 
examinations. The first is that the new national examination will 
be in the form of a communicative language test and the sec- 
ond, that the school teachers will participate in determining the 
standards and the content outline of future examinations. Teach- 
ers' questionnaires and the analysis of responses are the basis 
for the advocated changes. It is concluded that the traditional 
multiple-choice standardized tests liave had a negative effect on 
the teaching of reading. 



Introduction 

Israel's 1990 public examination was designed to re-estal')lish the reading 
ability of the fifth-year junior school children and identify those who 
had not mastered, as yet. the basic reading skills expected. (Children's 
level of competenc y was determined by a commissioned panel of *ex- 
perts' who composed and validated the standardized test. The pro- 
posed paper consisted primarily of short-graded reading passages 
followed by closed-ended (|uesti()ns. which required the selection of a 
X'orrect' response out of the four alternatives offered for most c|"estions. 
Students failing to choose any three or more correct answers o it of tlie 
first fifteen criterion-referenced (|uestions were deemed incompetent 
readers, regardless of their overall raw score in the whole sixty cjues- 
tions test. Indeed, in many cases the total scores of the so-called under- 
achievers exceeded those of their competent' counterparts, which 



E. Meiselles 



challenges the assumed predictive validity of the latest public examina- 
tion (Nevo, 1992). 

A further study of the public examination recently administered, 
also reveals: firstly, a ver>' narrow cross-curricular repertoire of short 
reading texts depicting by and large, descriptive genre characteristics; 
and secondly, abrupt switches from one topic to another, which means 
that any attempt to maintain a reasonable communicative sense be- 
tween the writer, the reader and the message (text) was, to say the 
least, accidental. Indeed, more than half of the reading passages offered 
are between one and three sentences long. These sharp changes there- 
fore recjuire irregular refocusing, which is communicatively unreal and 
very exhausting. 

The main criticism however, has been directed at the test's heuris- 
tic implications that reading is a collection of staged skills which can be 
passively tested. All the reader needs to do is identify the recjuired skill 
and guess the correct alternative. In other words, literary competency 
has been redirected to the control of routinized content-related skills, 
with little or no reference to structurally related strategies of unfolding 
meaning and identifying genre characteristics for communicative pur- 
poses (Valencia and Pearson, 19H7). After all, public-accountability ex- 
aminations, argues Cohen (1987), dictate in most cases what aspects of, 
and how any subject is taught. Whether we like it or not, wlia; is 
eventually taught and what is tested are intimately related. 

The Ministiy of Education and Culture in Israel has recently em- 
barked on the participation of teachers in the construction of the next 
public reading examination in Summer 199 i. The aims were twofold: 

• to narrow the widening gap between the more progressive 
teaching approaches encouraged in recent years in the literacy 
classroom and the demands of the traditional public examina- 
tions and the presentation of their scores; and 

• to reinstate the diminishing role of teachers as assessors for 
formative purpo.ses. and leduce by default, the overempliasis 
on summative assessment. 

The reason for this shift and the partial outcomes of the proposed 
venture are considered. 

Traditional Te*^tiiig and Curricular Misalignment 

The two key as.sumplions underlying most standardized testing tech- 
nology and practice, are ckr()m/)()sahilify'And (UxotitcxtmlizutixiU. Tlie.se 
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assumptions were compatible with the rouiinized skill goals of the past 
and with the psychological tlieories of the first pan of this centur>\ 
They are however, incompatible with what we know today about the 
nature of human cognition and learning (Johnston, 1990). Psycholog- 
ical theories of the 192()s assumed that thought could best be described 
as a collection of independent pieces of knowledge. This decompos- 
ability assumption can be clearly recognized in the behaviourist work 
of psychologists such as Fdward Thorndike and Hurrhus Skinner, which 
has had a profound influence on the instruction and testing of reading. 

Oecontextuaiization. the second major assumption built into stand- 
ardized tests, asseiis that each component of a complex skill is a fixed 
entity that will take the same form where\'er it is used. If .students know 
how to distinguish a fact from an opinion for example, they can do so 
under all conditions. It makes sense to select key critical-thinking skills 
for deconiextualized practice in schools; but this assumption no longer 
appears valid. Developments in the episteniology and phik)sophy of 
science show that there is no absolute line between fact and theory, 
data and interpretation (Lakatos, 197H). 

Consecjuently, most of the current .standardized tests are severely 
wanting and they are tuned to a curriculum of the past. Kven in school 
districts with an official policy against teaching for tests, considerable 
attention in the press or elsewhere to the test scores causes teachers to 
readapt their teaching and match their curriculum to the national tests 
(Leinhardt and Seewald. 1981). Fuilhermore, the public accountability 
systems of assessment for the purpose of reallocating ftmds had cen- 
tred on summaiive rather than formatlN'e evaluation. 

Public-testing practice remained essentially unchanged from the 
era in which it was considered enough for schools to teach the mastery* 
of routine skills by reading predictable texts (Fuhrnia, 19HH), Yet em- 
pirical e\idence has shown that: 

• children cannot understand what they read without making 
inferences and using infonnation that goes beyond the written 
text. 

• facts accjuired without structure and rationale disappear (juickly. 

To gauge the extent to which the decomposition and decontextual- 
ization assumptions permeate today s achievement tests, Shavelson and 
Stern ( 19H1 ) for example, examined the standiuxlized-test batteries wideU' 
u.sed in educational assessment as part of mandated testing programs. 
They found that reading-comprehension tests generally present short 
passages, together with multiple short (luestions. In asking for bits of 
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information rather than interpretation of an extended passage, these 
tests reflect the decoinposabihty assumption, treating knowledge and 
skills as accumulations of isolated pieces of information and not as 
coherent, interactive systems. In addition, these tests encourage quick 
finding of answers rather than reflective interpretation, allowing each 
examinee an average of five to six miniues to read a series of brief 
passages and answer five to eight questions per passage. Although the 
tests require a degree of textual interpretation, their isolated questions 
rarely examine how students interrelate parts of the text and do not 
require justifications that support their interpretations. In other words, 
the nature of the questions and the speed with which they must he 
answered do not invite the kind of reflection and elaboration demanded 
by the newer communicative approaches to reading. 

In summary, these tests tacitly convey a definition of reading as 
pursuing short passages to answer other people's questions. Furthermore, 
their format suggests that the answers to these questions are already 
known by the person asking them. Under such circumstances, reading 
comprehension appears to be a matter of finding predetermined answers. 
We need a much wider cross-curricular repertoire of reading texts de- 
picting different genre characteristics. The reader should ultimately 
demonstrate the use within the context of the immediate text and beyond 
it, empk)ying for example, the rhetorics of persuading, arguing, reporting 
etc. Applebee (1978) notes for instance, that first-year junior pupils can 
be taught to handle skilfully narrative characteristics and at the age of 
10 they can sustain a structured argument, express ()pink)ns. compare 
and contrast, and even persuade (Stein and Cilen. 1979) (see also 
Chapters 7 and 8 in this volume). 



Messages for Teachers 

In response to the recent changes in tlie teaching of reading and the 
traditional approach to testing, a dynamic model of formative assess- 
ment' designed to complement the communicative-process approach to 
the teaching of meaningful reading had been suggested (Champion 
and Brown, 198S). Their heuristic model is ba.sed on the Vygotskijn 
concept of the proximal development /one* — the area of funher 
advancement to be accomplished by employing cues of reflective inter- 
pretation, the justification for it and its eventual re-application beyond 
the text s framework. This nonlinear process approach to literacy must, 
in their view, be progressively enriched by: 
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• prior world knowledge such as, exposure lo information, rhe- 
torical characteristics and expressive cues, related to the texts 
in (|uestion and beyond them; and 

• applying what is read and/or written to other areas of know- 
ledge and to different communicative situations across the 
curriculum. 

The 'dynamic model of formative assessment' should therefore 
address its efforts to those strategies of imfolding meaning and using it 
in differently related communicative circumstances. Readers are thus 
expected, in other words, to demonstrate their comprehension by re- 
sponding to a given text in varied situations. This redepk)yment and re- 
application of communicative characteristics and skills be it in writing 
or orally, is a more xalid approach to assessing reading than the closed- 
ended multiple-choice questions traditkmally used (see for example 
Calfee, 1987). 

Guided by the growing e\'idence that Israels teachers are by and 
large, more attuned to the recent changes in teaching methods and 
testing of reading, than the so-called public-examinatk)n experts, the 
chairman of the Pedagogic Institute formed an acJ hoc steering com- 
mittee.' Also invited to participate was a repre.sentative body consisting 
of more than fifty teachers from varied disciplines and different schools 
across the country', in order to compose the next public reading-cum- 
wriling examinatk)n due lo take place in Summer 199-k This select 
group of teachers is responsible for the collection of varied and relex'ant 
reading passages of interest to their fifth-grade students. They are also 
reciuested to .suggest valid ways of testing their student.s* reading ability 
in communicative terms with particular reference to: 

• a wider cross-curricular repertoire of different types of texts; 
and 

• a wider range of te.stable strategies akin with unfolding mean- 
ing and using it in differently related comnumicative situations. 

Several examples of ([uestions attempting to test reading in com- 
municative terms along the lines advocated above, are presented below: 

1. The reader is asked to act as a policeman reporting a theft 
which took place on the beach, as eventfully described in the 
reading passage proposed. 

2. The reader is encouraged to write a thank you letter to a life- 
guard on behalf of a boy whose life was .saved by the lifeguard, 
as vividly described in the poem. 




/59 



H. Meisellcs 



3. The reader is invited to defend the choice of trax elling by a 
private car instead of a train to a Summer resort, by reconsid- 
ering the advantages and disadvantages of clifferent means of 
transportation as reported in the local newspaper. 

4. The reader is requested to anticipated the problem of weight- 
lessness on the moon s surface, and give a personal account of 
this imaginary' experience. 

5. The reader is rec|uested to outline the effect of freezing water 
in a sealed bottle with reference to: 

• the purpose of the experiment; 

• the materials used; and 

• the outcome of the experiment. 

(). The rcLider is encouraged to persuade the incoming tourists to 
the Hohiand to \'isit the Sea of Galilee as a significant histori- 
cal site. 

In the sample presented above the texts are N aried and like many other 
types of writing from different sources, submitted for pre-test selection 
and re\iew, they comprise writing assignments designed to demon- 
strate both functional understanding and communicative applicability. 



Conclusion 

It is still loo early as yet, to assess the validity of the proposed interpre- 
tation of the communicative approach to testing. What is cjuite ceitain 
however, is that the traditional standardized tests are not only inadec|uate, 
but they continue to have a negative effect on the teaching of reading. 
The departure from the traditional decomposabiliiy and deconiexlual- 
izalion concepts in leaching and testing, has been gradual and one- 
sided, rhc shift from leaching reading as separate staged skills lo an 
interactive communication, has not found its fullest expression in test- 
ing reading. Consecjuenity. the gap between teaching and testing has 
not narrowed. The testing model reviewed above has been designed to 
reckuc ibis conceptual mismatch betwuen teaching and testing. The 
principles of a dynamic model of formative assessment* haw been 
employed and integrated with the strategies of unfolding meaning and 
using it in differently related communicative situations. 
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Notes 

1. All schc)()l teachers in Israel are encouraged lo take a one-year fully paid 
sabliaiical every six years, tor an advanced course in education. Fuilliennore, 
regular in-seivice teacher training is remuneratively attractive. 
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Aims and Activities in Two Countries 
Participating in the lEA 
Reading-literacy Study 



I'. Froese 



Summary 

This chapter compares and discusses teacher strategies, aims and 
activities in two jurisdictions, British Columbia in Canada and 
the I SA in the International Association for the Evaluation of 
Educational Achievement (IFA) Heading-literacy Study, British 
Columbia espouses a child-centred literacy curriculum, whereas 
the majority of states in the TSA use matc^rial-centred reading 
approaches, known as basal readers, Aspect,s of teaching as 
well as student achie\*emciit are compared for the 9 year-old 
populations across the two s\*stems. There is evidence of im- 
portant differences in the material-centred and the pupil- 
centred jurisdictions with respect to teachers* aims, strategies 
and activities. However, the answer to but who did best?' is not 
simple, hidications are that similar test achievement can be 
obtained by (juite different educational curricula. 



Introduction 

The purpose of this chapter is to compare and discuss teacher strat- 
egics, aims, and acti\'ities in two jurisdictions pailicipating in the li^A 
Reatling-literacy Study: British Columbia in Canada and the I'nited States, 
These two jurisdictions are of particular interest since Ikitish Columbia 
(BC) espouses a child-centred literacy curriculum whereas the majority 
of states in the I *S,A use material-centred reading approaches known as 
ba.sal readers, Ik*cause of space limitations only the younger age groups 
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(Population A) are considered here. Before specifically addressing the 
strategies, aims, and activities it is necessary lo consider the context in 
which the information was gathered. 

Overview of lEA Reading-literacy Study 

In 1986 the International Association for the E\'aluation of Educational 
Achievement (lEA) initiated this literacy study and an international steer- 
ing committee first met in Washington, DC in 1988. A tentative blue- 
print was mapped out b>' representatives from thirty-five participating 
systems (countries in most cases), Warwick Elley was appointed chair- 
person of the steering committee and an International Coordination 
Centre imder the direction of Ne\'ille Postlethwaite was established in 
Hamburg, Germany to facilitate the study. The steering committee was 
responsible for item development, for questionnaire de\'elopnient. tor 
pilot testing (1989-90), and for the final selection of items. 

The actual test administration for the lEA Reading-literacy study 
was completed in 1990 in the southern hemisphere and in 1991 in the 
northern hemisphere. The goals of the study were as follows: 

1. to produce \'alid international measures of reading literacy: 

2. to identify the proportion of students reaching various levels 
of literacy: 

3. to show the frec|uency and use of reading literacy: 

i. to identify factors influencing literacy in students, teachers. 

and schools: and 
S. to establish a 1991-2 database for future comparisons. 

Although initially it was intended to assess literacy in the general 
sense this was abandoned due to complexity and cost for the more 
focused reading literacy* which was defined as 'the ability to under- 
stand and use those written language forms re(juired by society and/'or 
valued by the individual and the focus was on material.', commonly 
found in the home, school, community, and workplace in all countries. 

Students to be included were generally 9 and l i-year-olds but the 
.sample was defined as all students attending on a full-time basis at the 
grade level in which most students aged 9:00-9:11 years (or 14:00- 
l i:l 1 years) were enrolled during the first week of the eighth month of 
the school year. The younger age group was referred to as Population 
A and marked the end of primaiy school, and the older group known 
as Population B marked the end of compulsory- schooling in most 
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countries. Due to the nature of the definition about half the participat- 
ing systems tested in grades three and eight (including BC), the other 
half at grades four and nine (including the L-SA), and one system at 
grades five and ten (New Zealand). All told some 93,039 students in 
4,353 schools at Population A and 117,020 students in 4J2() schools at 
Population B participated in the study. Teachers numbered 4,992 at 
Pop. A and 5.526 at Pop. B level. More specifically BC tested in 157 
schools in Pop, A and 197 in Pop. B: the I'SA tested 165 in Pop, A and 
165 in Pop, B. 

The systems participating in the lEA reading-literacy countries were: 
Belgium. Botswana, Canada (BC), CypRis, Denmark. Finland. France. 
Germany. Greece. Hong Kong. Hungary-. Iceland. Indone.sia. Iceland, 
Italy. Ireland, The Netherlands. New Zealand. Nigeria, Norway. The 
Philippines. Portugal. Singapore. Slovenia. Spain. Sweden. Switzerland. 
Thailand, Trinidad and Tobago, The I'nited States, Venezuela, and 
Zimbalnve. 



Method 

The histnimoits 

Reading-litenicy tests were developed for botli Populations A and H. 
For Population A, a word-recognition test with forty matching items 
was included as well as a number of narrative, expositors, and docu- 
ment passages with accompanying multiple-choice, short-answer, and 
long-answer responses. Population B had narrative, expositoiy. and 
document items but no word-recognition component, Kach group ha'.i 
two testing sessions, approximately seventy-five minutes for the younger 
age group and whoux eighty-five minutes for the older group, A student 
(juestionnaire was untinied and paced by the teacher. The question- 
niiire included personal information and interests, details about home, 
about reading habits, about homework, and about the reading instruc- 
tion they were receiving. 

The classroom teacher completed a te*acher questionnaire which 
included information about the teachers e^iucational training, about 
the class being tested, about teaching activitie^s. al)()ut aims of reading 
instruction, about the classroom and school librarv*, and abcuit school 
organi/.ation. Additionally a school principal's (juestionnaire was admin- 
istered to document the principal's training, community resources, facts 
about the school lil^raiy, instructional time, and about the principal's 
role. 
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Finally, a national case-study questionnaire was completed to pro- 
vide information about the cost of schooling, the structure of the school 
system, student enrolment, the reading curriculum, gender statistics, 
teacher training, and so on. Some of this information was used to de- 
vi.se a Compv)site Development Index (GDI) which was later used to 
interpret reading-literacy outcomes. 

VnUjtie Features of the IFA Literacy S/iuly 

Since it had been approximately twent\' years since the last large-scale 
international Pv^acy study had been undertaken by Thorndike (19"^3), 
it was natural to expect that curricula in the various countries might 
have changed as well. As a result considerable debate surrounded the 
content of the instruments to be used. Particularly evident was the 
realization that the teaching of literacy had progres,sed in some coun- 
tries from an exclusively bottom-up (i.e.. skills) approach to a modified 
top-down (i.e.. meaning-based) approach. The final result, after much 
debate, was li fiiir compromise. Some of the unicjue features of the 
resulting instrument follow. 

1. Common piissages 

Three common pa.ssages (Temperature, Marinot. Shark) were 
embedded in the Population A and the Population B tests to 
give some indications of growth in ability to answer compre- 
hension ({uestions between age groups. In fact, an approxi- 
tiiate I k per cent growth was obserxed in the five-year age 
difference of the two groups. 

2. Passage length 

Since there is some indication that reading behaviour changes 
with su.siained reading it was agreed that both short and long 
pas.sages would be included. Long passages were defined as 
those recjuiring more than two minutes to read, and because 
of .space and time limitations only two were included at each 
age level. In Pop. A pas.sages ranged from about forty to over 
*^()() words, in Pop. B from about ISO to over 1, ()()(). 

3. (Question types 

vSince con.sidenil^le criticism has been expressed over an exclu- 
sive reliance on multiple-choice questions, other respon.se 
modes were included. 'Hiese included two types of open-ended 
items — some re(|uired word responses to be written, others 
rec|nired several sentences. Other items recjuired students to 
respond in a variety of ways to ditvctions. 
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4. Text type 

Three types of text were presented: narrative, expository, and 
document. Narrative text was defined as Cv)ntinuous text in 
which the writer s aim is to tell a stor>' — whether fact or fiction. 
Expository text was continuous text designed to describe, 
explain or otherwise conxey information or opinion to the 
reader. And documents were structured information displays 
presented in the form of charts, tables, maps, graphs, lists or 
sets of instruction. 

5. Passage content 

Because reading literacy was defined as the ai:)ility to under- 
stand and use those written-language forms recjuired hy soci- 
et\' and/or valued by indix iduals, it was decided to include a 
balance of passages based on the themes of home, school, and 
society/work. For Pop. A there were passages about animals, 
pets, family, school timetable, and a postcard. At the Pop. B 
le\'el there were passages about animals, job vacancies, bus 
schedules, humour, and medicine directions. 
(). Participation 

The lEA reading-literacy study was one of the largest of its 
kind, it involved thirty-two school systems from around the 
world. Most systems represented countries hu\ some like Cier- 
many (West) and Germany (East) were the result of political 
changes during the course of the study. Others were different 
because they represented only parts of countries: Belgium 
(French) and Canada iWC), In tiie latter case, only the Prcn ince 
of Britisii Ciolumbia. one of ten provinces in (Canada, particip- 
ated in the study. Twenty-one language groups were involved 
including nine countries that tested in English: Canada (B('). 
Botswana, Ireland, New Zealand. The Philippines. Singapore. 
Trinidad and Tol^ago. The I'nited States, and Zimbabwe. 
Statistical Analysis 

Extensive pik)t testing preceded the selection of pas.sages and 
items. Passages were ixick-translated to assure accuracy. Pilot 
testing allowed the selection of items that performed similarly 
across translations in the twenty-one different languages that 
were involved. A statistical scaling technkiue known as the 
Rasch procedure was used to produce scales for each domain 
witli a mean of SOO and a standard deviatkm of 100. This 
allows the comparisons of scores on a common scale across 
countries, regardless of the language of testing. Probability 
sampling was under the direclk)n of an international sampling 
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coordinator to assure thai fair sampling procedures were fol- 
lowed in all countries. 
8, Student, teacher, school information 

The extensive information gathered about students, leacliers. 
and schools (described earlier in this chapter) allows a variety' 
of in-deplh analysis of factors that affect achievement. It is 
possible, for example, to look at leaching strategies thai lead 
to high achievement; to examine which school and commun- 
ity attributes are related to achievement; to see how aims of 
instruction affect achievement; and so on. From the student's 
perspective it is possible to examine how aspects of the school 
and home environment affect achie\'einenl. The data also gives 
insight into the frequency and use of reading literacy in today 's 
world. 



Results 

Stude}it-ce)itred rs Material-centred I)istructio)i 

At the outset it was staled that this chapter intended to look at results 
from the IF.A reading-literacy study from the perspective of two jurisdic- 
tions with contrasting curricular approaches, British Columbia espouses 
a child-centred curriculum whereas a majority of the Tniied States use 
the material-centred reading approaches known as basal readers. 

While it is difficult to pin down exact numbers, a study of I ().()()() 
elemeniar\' teachers in the I'SA in the 197()s found that per cent of 
them relied on basal materials (Education Product Information Hxchange. 
1977); a later estimate was 90 per cent (Cioodman. Shannon. Freeman 
and Murphy. 1988). Kssentially these basal or reading series consist of 
a graded set of textbooks for each of the elementary grades and 
accompanying workbooks, a teachers manual which provides in- 
structions as to what the teacher should do in each lesson, and various 
supporting materials (i.e.. charts, big books, tapes, etc), and testing 
materials which closely emulate the instructional materials. Instruction 
based on such materials has dominated reading instruction in the I 'nited 
Stales tor over fifty years. 

In Hrilish Columbia many teachers used similar materials until re- 
cently, but by 1991 only per cent reported u.sing them, liome 23 per 
cent indicated using a literature-based or Whole Language approach, 
and per cent indicated tliat they used eclectic approaches. A Royal 
Commission on Ixkication ( 198*^-8) conducted an extensive review 
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of BC schools and its report A Legacy for Leanios made eighty-three 
recommendations dealing with a wide range of topics, including cur- 
riculum, assessment, reporting, and learning. Three key points about 
learning were intended to shape the programme: 

1. Learning requires the active participation of the learner, 

2. People learn in a variety of ways and at different rates, 

3. Learning is both an individual and a ,social process, (BC. Min- 
istr>' of Education: 1989). 

The publication also stated fi\'e goals of the primar\- programme: 

• aesthetic and artistic development; 

• emotional and social development; 

• intellectual development; 

• physical development; and 

• social responsibility. 

One paragraph simply describes what was intended: 

Teachers phm learning experiences based on the guiding 
principles of the program. They obsene the /cY^/v/c^r carefully 
within the context of daily learning. They talk with the stiuJcnt 
in conversations and conferences and listen to what the child is 
saying about his or her learning. They examine collections of 
each stiuicutswovk. Teachers encourage stucloitsxo represent 
their ideas in many different ways, including painting, role play- 
ing, writing, constructing, manipulating objects, etc. Through 
these processes teachers. o\*er time, begin to get a picture of 
what the stucknit cwn do and can make informed decisions about 
future learning hny emphasesi, (BC Ministry' of Education, 1989. 
p. O 

Even from this brief description the reader can see that the two 
systems are (juite different in terms of who makes the decisions. In the 
basal programmes most decisions have been made by the 'experts' 
guiding the material development and are incorporated into the direc- 
tions provided for the teacher and .student. In the BC curriculum the 
classroom teacher is expected to iiiake the decisions based on 
contextuali/ed obsen'ation in real learning situations. 

The IHA reading-literacy ,study in 1991 provided an opportunity to 
observe how teachers in the contrasting systems saw their roles (i.e.. 
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reading aims), what types of classroom exercises they favoured (i.e., 
reading activities), and what sort of methods they preferred (i.e.. read- 
ing strategies). 



Teacher Strategies, Aims and Activities 

The teacher questionnaire provides considerable information about the 
teachers* educational training, the class being tested, leaching proce- 
dures, the classroom lit")rar>\ and school organization. Further, the study 
makes it possible to examine the connection of these various attributes 
with achievement on the reading portion of the test. Because of space 
constraints, this chapter considers only the teachers' aims, instructional 
strategies, and assigned reading activities. 



Teachers were asked to rank their top five aims of reading 
instrLiction of twelve items presented. Note that because not 
every teacher ranked all items only relative differences should 
be considered in Figures 11.1 and 11.2. 

2. Instructional strategies 

Teachers were asked to indicate how often they used each of 
the thirteen instructional strategies when teaching reading. 
Choices were almost e\*ery day, about once or twice per week, 
about once per tnonth, or almost never. The findings are pre- 
sented in Figures 11.3 and 11.4. 

3. Reading activities 

Teachers were asked to mark how often students were typi- 
cally involved in a list of twenty-eight reading activities. Again 
frecjLiencies were descritx'd as almost dail\', about once or twice 
per week, about once per month, or almost never. Data are 
presented in Figures ll.S, 11.6. and 11.7. 

To e.stalMish a sense of importance of the various aims, strategies, 
and activities, a iialf do/en doctoral students in a readitig-education 
programme at the Cniversity of British Columbia were asked to rate 
them as to whether they involved high, middle, or low levels of cog- 
nitive activities. Figures 1-7 present comparisons for the two jurisdic- 
tions (IK! atui I'SA) grouped by level (high, middle, low). The reader 
may wish to impose some other order on iliese items, but it was fell to 
lie a useful way of considering tliem. 
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Figure 11.2. 
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Since it is inslniclive lo determine how students in the two juris- 
dictions performed on the literacy test it was necessary to complete a 
supplementary growth study (Froese and Conry, 1993) since grade- 
three students were tested in BC (average age 8.9), grade-four students 
in the USA (average age 10.0). and grade-five students in New Zea- 
land (average age 10.0). Figures 11.8, 11.9 and 11.10 illustrate the 
international-standard scores achie\'ed (mean = SOO, sd = 100) in the 
N'arious domains on the literacy test by BC students, American students, 
and for added interest two other countries te.sting in English: New 
Zealand and Ireland. It should be remembered, howe\'er, that the growth 
study was conducted in 1993. exactly two years after the original lEA 
study. 

Messages for Teachers 

As can be seen from Figure 11.1. wlien considering those aims consid- 
ered lo be high' cogniti\'ely. five of nine aims are ranked more highly 
by BC> teachers. These are: 

• develop interest in reading: 

• develop research and study skills: 

• deepen emotional understanding: 

• expand reading choices: and 

• make reading enjoyable. 

BC' and l.S teachers are \'irtually tied on developing critical thinking 
and expanding workl \'iew. The I *S teachers rank improving reading 
comprehension and extending vocabulaiy more highly. In Figure 11.2. 
in the iow/medium' le\'el activities, only one. improving word-attack, is 
rated higher by BC teachers, the others are rated higher by US teachers. 
Why the differences? An understanding of basal series makes the finding 
fairly apparent. The typical pattern suggested in the manuals is to in- 
troduce a sloiy by focusing on vocabulary' that is intended to be new", 
to read the selection silently, answer comprehension (|ue.stions from 
the manual or workbook, and discuss the selection (and have students 
read orally selected passages) and correct the (|uestions. The student- 
centred curriculum in contrast integrates the language acti\'ities with 
the learning of other content, whether science, social studies, or litera- 
ture. It emphasizes the heuristic function of language, but also stresses 
the making of choices, and the social nature of learning. 

I'igures 11.3 and ll.i focus on reading strategies. Again, when 
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considering the iiigh-Ievel* strategies, seven of ten are used more fre- 
quently by BC teachers; of the three *low/niiddle* strategies two favour 
the US teachers. As can l")e seen from Figure 1L3, BC teachers fre- 
quently assess prior knowledge, describe strategies used, get parents 
involved, compare stories and poems, read aloud to class, have parents 
reading to students, and encourage use of the library'. The US teachers 
favour questioning to deepen understanding and to model understand- 
ing — both again closely related to the basal methodology. It is particu- 
larly interesting to note the importance placed on involving the parents 
by the BC teachers. Also, the strategies appear to reflect the wider 
range of materials read, the adaptation of strategies to reading different 
materials, and use of reading for learning. Figure 11.4 indicates that the 
I'S teachers use graded sequence of texts and ask comprehension 
questions more frequently whereas BC teachers u.se self-made materials 
more frequently. This appears to reflect the different methodologies 
quite plainly. 

The next set of figures. Figures 11 5, 11.6 and 11.7. give an indica- 
tion of the types of acti\'ities teachers indicate that they use in the 
classroom. Ten of the fifteen iiigh-level* activities are used more fre- 
quently by BC teachers. These include .silent reading in class, inde- 
pendent reading, discussing books, incidental vocabulary' development, 
reading play's or dramas, drawing in response to reading, reading other 
.students* writing, making predictions, having students lead discussions, 
and writing responses to leading. IIS teachers use summarization, making 
generalizations and inferences, and reading in the subject areas more 
frecjuentiy. To some extent the latter are more routinized and rellect a 
more structured common set of readings, while the former indicate a 
range of individually selected materials which have been individually 
selected by students. Figure 11.6 indicates that of the nine 'middle- 
lover activities. se\'en are used more fre(|uently by BC teachers: oral 
reading in groups, teacher reading to class, reading games, dramatizing 
stories, diagramming stories, studying the .style and structure of materials, 
and comparing pictures and text. US teachers use writing comprehen- 
sion answers more freciuently. Of the 'low-lever activities li.sted in 
Figure 11.7. the US teachers use three of five more fre(|uently; that is. 
word-attack, oral reading, and vocabulaiy lists. It is almost ironic tlial 
letter-sound relationships are given more attention by the BC teachers 
when the basal-reader methodology typically stresses phonics. It may 
be that vocabulary control actually reduces the necessity to apply the 
very skills that are intended to be learned. 

After pointing out the above differences it is common to be asked; 
But who did better? This naturally depends on which criteria one uses. 

Ibi) /7? 



Figure 1 1.5: Reading activities — USA vs BC: High level (Pop. A) 




Figure 7 1.6: Heading activities — USA vs BC: Middle level (Pop. A) 




Figure 1 1 7: Reading activities — USA vs BC: Low level (Pop A) 
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or what one values. Figures 11.8-11.10 provide some brief comparative- 
achieveineni information for the three domains tested-docunient, 
expository', and narrative. In general BC students and US .students per- 
formed very- similady when comparing grade-four students on the in- 
ternational .standard scale. Both jurisdictions performed slightly better 
than Ireland, a country also testing in English, and when comparing BC 
grade-five .students with New Zealand grade-five students the former 
achieved significantly higher levels. 

Figure 11.11 indicates the growth of achievement of BC .students 
from grade three to grade ten when responding to the same passages. 
Steady growth is seen imtil grade eight, then plateauing occurs partly 
becau.se students are topping out on the measures used. Growth (^f 
about 13 per cent is experienced betv^'een grades three and four on the 
narrative, expo.sitory^ and document items when con.sidering interna- 
tional standard scores. This is a radical contra.st to the growth found by 
Elley (1992) of about 5 per cent. The growth between grades four and 
five tapers off considerably to 5.5 per cent for narrative, 4,5 per cent for 
expository, and 5.6 per cent for document items. The answer to But 
who did better?' is not .simple. In fact indications are that similar achiexe- 
menl can be oi')tained by cjuite different educational curricula. How- 
ever, insofar as the data on teachers* aims, .strategies, and activities 
reflect their methodologies, there is evidence of important difference in 
the material-centred and the .student-centred juri.sdictions contrasted in 
this repon. 

Conclusion 

The reader may find further information about the IFA reading-literacy 
.study in publications by Hlley (1992). Po.stlethwaite and Ro.ss (1992). 
and i^y Lundberg and Linnakylii (1993). It is important to note, how- 
ever, that compari.sons in these publications are made across countries 
regardless of the age or grade level of the .students. As indicated by the 
WC growth .study, and by previous data found in a Swedish .study, grade 
level is a .significant determinant of acliievement and should not Ix* 
overlooked when making compari.sons among countries. As indicated 
in a recent article (Stahl. Iliggin.son and King. 1993). we need to l)e 
cautious aixHit 'misinterpreting or misrepresenting cro.ss-national or cro.ss- 
cultural findinj^s. and judging the results of a nation s literacy as.sess- 
menl as an OKmpic event in a worldwide game of pedagogy'. 
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Figure 1 1 8: Document-reading: test companson (Pop. A) m selected countries 




Source BC Growth Study 1993 



Figure 11.10: Narrative-reading: test comparison (Pop. A) in selected countries 




Figure 11.1 V BC students' growth of achievement in response to same passages (Pop A and B) 
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Note by Editors 

Because literacy enriclies the indi\'idual and the State, botli culturally 
and economically, it is not surprising that the IFA reading-literacy study 
has aroused considerable international interest and controversy. Explicit 
and public accountability for the considerable in\'estiiient made b\' coun- 
tries in their state educational systems is a political high priority. 

Increasingly, the nations of the world acknow ledge their member- 
ship of a global community in wiiich children are entitled to an edu- 
cation. The mass-media have made it possible for more people than 
ever before to gain an appreciation of life in other cultures, including 
the variety of their educational systems. The relative standards of lil- 
erac\' of pLipils b<nng educated in different social and economic con- 
texts and un(.ler a variety of pedagogic philosophies and their related 
curricular activities, are of both theoretical and practical importance. 
This has been demonstrated in the present chapter. 

Quite clearly, the assessment of reading attainments and the com- 
parison of standards across thiily-two school systems poses important 
and challenging conceptual and empirical problems to the test devel- 
oper. The twenty-one different languages invoked in the IFA study 
further compounded these difficulties. Interpreting the results recjuires 
expertise. 

To assess the standards and progress in literacy' of tiieir pupils and 
also to identify and allexiate individual difficulties, teachers in many 
countries use a wide range of tests and assessment techni({ues, Fre- 
cjuentK , these instruments are de\'ised and constructed by specialists in 
measurement tiieory and practice. To the majorit\ of teachers, test 
theor\' probal')l\ has the appeal of fillet steak to a \ egetarian. Whereas 
there are dietaiy substitutes for meat, an understanding of test theor\' 
is essential to pn^fessional accountability in education, 

hi the interests of the pupils and parents that the teaching profes- 
sion exists to serve, it is important that its members become familiar 
with the concepts underpinning the Rasch model on which the IFA 
reading literacy study reading scales in each domain w ere constructed. 
If the results obtained are to be understood by the teaclier, the assump- 
tions and rationale underpinning the Kasch model haw to Ik* appreci- 
ated. So too has its strengths and weaknesses. 

Ac(|uiring such understandings re(|uires a willingness of the reader 
to wnture into relatively unfamiliar conceptual areas. The exposition 
presented in the follow ing chapter. b\' the head of the Danish National 
Institute for Fducational Research, provides a fitting continuation and 
technical exposition of ideas cvntral to the present chapter. 
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Summary 

This chapter provides a brief overview of the initial statistical 
analyses of student data employed in the lEA reading-literacy 
study conducted in the years 1990-1. Attention is drawn to how- 
reading theories may relate to the design of the study and the 
consiaiction of test materials. The test booklets and question- 
naires relating to reading related factual information such as 
'number of books in home*, used in the stud>' are described. 
The investigation found that no single variable or set of vari- 
ables can 'explain* variation in children's reading performance 
but certain trends are identifiable. 



Introduction 

It is the aim of this chapter to present a brief over\iew of the initial 
statistical analyses of student data employed in the lEA reading-literacy 
study conducted in the >'ears 1990-1. From the point of view of a 
statistician the intention is, accordingly to emphasi/e merely technical 
aspects raised during the analysis of student data rather than addressing 
theoretical aspects of the process of reading and, ewntually, how read- 
ing theories might be related to the design of the study, the construc- 
tion of test materials, (juestionnaires etc. The following will summari/e 
the design of the study: the construction of three international reading 
scales used for comparing student abilities all over the world: and 
finally how student reading performances were related to background 
factors sampled jointly with the test results. 



The lEA Study of Reading Literacy 



Design and Method 

The Study 

The idea of setting up an international study, like the lEA reading- 
literacy study, originated from the ambition to study reasons' for being 
a high or low achieving student in terms of explaining background 
factors. In fact, many national studies aiming at revealing proper ex- 
planations* fail to succeed, because the variation of background factors 
within a country is limited due to homogeneous educational systems — 
and, hence, a study on an international basis can bring about adequate 
variation in these factors in order to reinforce the relation between 
reading performance and background. 

The study was conducted as a pilot in 1990 and a main study in 
the years 1990-1 on two populations of students: 9-year-olds from 
grades 3/4 (called Pop, A) and 14 -year-olds from grades 8/9 (called 
Pop, B) in thirty-two countries aroimd the world: 



Belgium 


Finland 


Hong Kong 


Italy 


The Philippines 


Sweden 


Vene7Aiela 


Botswana 


France 


Hungary' 


The Netherlands 


Portugal 


Switzerland 


Zimbabwe 


Canada 


Germany(W) 


Iceland 


Norway 


Singapore 


Thailand 


Cyprus 


Germany(E) 


Indonesia 


Nigeria 


Slovenia 


Trinidad 


Denmark 


Greece 


Ireland 


New Zealand 


Spain 


The rSA 





The major aims of ilie study adopted by the National Research 
Coordinators (NRCs) included the following: 

• to describe the voluntaiv reading activities of 9 and i-i-vear- 
olds; 

• to identify differences in policies and instructional practices in 
reading, and to study the ways in which they relate to students* 
achievement and voluntary* reading: 

• to produce valid international tests and cjuestionnaires which 
could be used to investigate reading-literacy development in 
other countries; and 

• to provide national baseline data suitable for monitoring changes 
in reading-literacy levels and patterns over time. 
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The study should also be seen as a continuation of earlier lEA 
reading studies (Thorndike. 1973); it was not initiated from a set of 
strictly formulated reading hypotheses which, like in other smaller 
studies are tested directK', one by one, hx adopting rele\'ant statistical 
tests. The data were collected in all participating countries according to 
sampling procedures accepted by all NRCs and were, later on, sent to 
the International Coordinating Centre (ICC) situated in Hamburg, where 
all subsecjuent data analyses were undertaken. The sampling of data 
was carried out in such a way that conclusions drawn from data C(uild 
be considered as representatixe for students and teachers in the coun- 
tries at the specified grades. Throughout the stud\' period several meet- 
ings between the NRCs and the Hamburg ICC were held and decisions 
as to the analysis strategy', forming of conclusions, publication plans 
etc. were discussed during such conferences. 

The complete te.st materials compri.sed test booklets to the .stu- 
dents, (juestionnaires to .students: que.stionnaires to teachers and .school 
principals and, on local initiatiw, a series of .so-called national options* 
to be added to te.st booklets and ciuestionnaires. The information pro- 
vided by national options \\ as, howcx er. not subject to the international 
cro.s.s-c()untr\' analvses but is now available for further within-country 
analyses, 

Immediatelv after completion of the first international .slati.sticai 
analy.ses on all data, three books ha\ e been published: Haw in the irorUi 
do students rcdd'f (Hlley, 1992), which is the main reference for this 
chapter. Teaching tvadin^ii aronnd the /ror/r/(Lundberg and Linnakylii, 
1992) and Effeetire Schools in reading (Po.stlethwaite and Ko.ss, 1902). 
A book containing detailed descriptions of all technical a.spects during 
the study will appear .soon (Beaton, I99 t). 

The international books will be succeeded by a number of national 
reports in the future, after the data w ere available to all countries as a 
con.secjuence of publication of the first international books. 

The .student te.st booklets consi.sted of text pas.sages to be read by 
the students and. afier having read the pa.s.sage, a number of (jue.stions 
— which are the items of the te.st — were then presented in either 
clo.sed multiple respon.se-calegoiy form (mc) or as open-ended c|ue.s- 
tions. The mc-(|ue.sti()ns were con.slructed usually with four aaswer 
categories, only oiie of which could be considered to represent the 
correct im.swer. An example is given in Figure 12.1. 

rhe pas.sages were, for both populations A and b, grouped in 
advaiice into three .so-called domains: 
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Figure 12.1: Text passage from population A, expository domain 



The Walrus 



The walrus is easy to recognize because it 
has two large teeth sticking out of its 
mouth. These teeth are called eye teeth 

The walrus lives in cold seas. If the 
water freezes over, the walrus keeps a 
hole free of ice either by swimming round 
and round in the water, or by hacking off 
the edge of the ice with its eye teeth. The 
walrus can also use its skull to knock a 
hole in the ice. 

The walrus depends on its eye teeth 
for many things. For example, when 
looking for food a walrus dives to the 
bottom of the sea and uses its eye teeth 
to scrape off clams. The walrus also uses 
its eye teeth to pull itself onto the ice. It 
needs its eye teeth to attack or kill a seal 
and eat it. or to defend itself if attacked by 
a polar bear. 

1 Where does the walrus live? 




The walrus may grow very big and 
very old. A full-grown male is almost 4 m 
long and weighs more than 1000 kg. It 
may reach an age of 30 years. 

The walrus sleeps on the ice or on a 
piece of rock sticking out of the water, but 
It IS also able to sleep m the water 



AL" In very cold water 

BZ In tropical countries 

CL- On the bottom of lakes 

DZ In cold forest country 

2 How long can a walrus 'ive 



A~ 2 years 

B'Z 4 years 

CZ 30 years 

D~ 100 years 



• AV/r;v///rcMe\t types: 'IVxis iluil idl a sloiy or tlic order in 
wliich tilings luippcn (Pop, A: iwcniy-two items. Pop. B: iwenly- 
niiie items), 

• Expository text l\ pes: 1 Vxls tlial deseriix* things or people or 
explain how things work or why things happened (!\)p. A: 
twenty-one items. Pop, B: twenty-six items), 

• PocumcNts lew types: I'ahles. charts, diagrams, maps (!\)p. A: 
twenty-three items. Vop. H: thirty-four items). 

The intention hehind ihe grouping ot items into domains was to be 
able to a.ssess and compare student abilities using index values or score 
values as outcomes tor each specific domain. 
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The questionnaires filled out by students, teachers and school 
principals encompassed a variety of aspects ranging from (Questions to 
students like 'Number of books in home*, Hours TV-watching' over 
instRictional strategies for the teachers like Use of materials you have 
prepared yourself, 'Ask questions to deepen understanding* to factual 
school information like *Size of school library' and classroom size'. 

Altogether 656 variables containing test results and responses to 
questions were transferred back to the national centres after the com- 
pletion of the international analyses. Counting about 200,000 participat- 
ing students all over the world located in the two populations, the total 
number of obserx ations quickly adds up to approximatel)' 100 million 
bits of infomiation. No doubt, one might say, that it should be possible 
to find at least a few significant relations since, using conventional levels 
of significance for statistical tests, around S per cent type-I erroneous 
conclusions should anyway occur. One of the major technical issues 
was, in fact, to wssi^^^ tme levels of significance, when carry ing out so 
many statistical tests as intended in the study. 



A Statistical Model for APUilysis of Jtew Responses 

Although responses to the test items were originally sampled as multi- 
ple choices and, cenainly, important studies can be undenaken to see 
why and how distractors* (i.e., the wrong answer categories) are used, 
it was an early decision to conduct the international analyses on the 
levels: correct/non correct/missing for each item. Funhermore, in com- 
plete accordance with the verv* idea behind the concept of a domain, 
it was decided to construct index values, or domain scores, for each 
domain, when measures of student performance' were ready to be 
analysed in relation to background variables, rather than analy sing single- 
item information in relation to background variables. By this, it was 
tacitly assumed that all items within a fixed domain actually belong to 
one cc^mmon. latent continuum of 'difficulties* defined by the domain. 
The basic set of observations within a domain can Ix- visualized b>* the 
following frame of reference displayed in Table 12.1. 

Let the number of correct!)' solv ed items r„ across all k items 

in a domain be the manifest measure of student ability and, likewise, 

let the total item-scores (across all n students) s, s,^ be manifest 

measures of the item difficulty in the sense, that these scores exhaust 
all information about the student abilities and the item difficulties, 
respectively. Ihis is ecjuivalent, in statistical terms, to acknowledging 
these two sets of scores as sufficient statistics for student abilities c, v 

IS)0 
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Table 12. 1: Item-responses model within a domain 





Item No. 


Scores 








1 


i 


k 


(0 




Student No. 


1 






2 

V 


1 0 1 
1 1 0 


a 


1 

0 


a,. 

a?- 

a,.. 






n 








Item totals (s,) 




a, . . 


.. .a, 


a. 


a.. 



Note: Responses a.„ = 0.1 from n individuals to k domain items with individual ability 
parameters v = 1 n and item difficulty parameters 9. i = 1. . . . . k 



= 1 n and item difficulties 9. i = 1 k. At the same time a user- 
defined statistical model for analysing such schemes of n by k correct- 
non correct obser\'ations has been requested. In fact, Rasch (Rasch, 
1971) proved, that the statements of joint sufficiency of the item scores 
and student scores are unambiguously associated with the following 
probabilistic model ( 1 ) for the single-item response a,. = 0 (non correct ) 
or a,. = 1 (correct) provided by student No. v to item No. i, with 9^ 
denoting item difficulty (or easiness) and a, denoting student ability: 

(1) p(a„ = 1) = — 

1 + e 

If the Rasch model (1) — also called the one parameter logistic 
model — fits the data in Table 12. L the items are said to lie homogeu- 
eons. One of the iiK^or problems for the initial item analyses in the 
l¥A study was two-sided: to test for item homogeneity uithin each 
country- and across [ht^ countries. In fact, only if consistent item diffi- 
culties 9, i = 1 k across all countries could be identified, would 

one, indeed, have a common reading scale acting as a fixed 'ruler' for 
measuring student ability in eveiy countr\-. It is, however one of the 
essential attributes of the Rasch model, that if the model fits data the 
re.sponses to ^'n/ v subset of homogeneous items can be used for assess- 
ing student al^ilities a, v = 1 n: therefore, if a few items fail to be 

homogeneous across all countries — typically in a way that an item 
'works' (i.e., is homogeneous) irithin some countries but has to be 
omitted internationally e.g.. because of strongly vaiying difficulty — a 
reading scale still exi.sts. This is formed, then, by a set of core items 
applicalile to all countries and, depending on the countr\\ extended by 
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Figure 12.2: Item characterisxic curves for the Rasch model (V. the two-parameter model 
(3) and the three-parameter model (4) 

A Probla^i = lla^. fixed 8,) 



extra homogeneous items. The International Reading Scale for each 
domain is. consequently, a total reference set of items which, in part is 
used from one country to another. Underlying these technical efforts is. 
of course, a wish to utilize as much of the item information as possible 
in each countr\'. 

The struggle for homogeneous items is far from being just an aca- 
demic exercise, since it is easy to demonstrate (e.g.. through computer 
simulations) that a standard t-test for equal means of two populations 
judged by student scores, can be significant exclusively from the fact, 
that the two sets of student scores are outcomes from group-specific 
inhomogeneous items and. hence cannot be properly ascribed to dif- 
ferences in levels of ability. 

The Rasch model ( 1) is a member of the class of Item Response 
Theory HPT) models, where differences l')etween the models can be 
characterized by differences in the Item Characteristic Curve (ICC), i.e.. 
tlie probability of a correct response to item No. i with difficulty 9, 
examined as a function of the student ability o,: 

(2) ICC (a,) = Prob (a,, = 1 I a,, fixed 9.) 

The Rasch model — or the one parameter model — has parallel 
ICCs for all items, the position on the X-axis being determined by the 
value of the item parameter 9,. see Figure 12.2. The two parameter model 
(5) n.ord and Novick. 1968) is characterized by intersecting ICCs — i.e.. 
slopes of the ICCs differ from item to item. Many psychometricians 
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consider items with high slope values — viz., high discrimination power 
— being an advantage, because such items 'discriminate' well between 
abilities on the X-axis, but only under the one-parameter model is the 
concept of student ability related in a simple way to the number of 
correct responses across items. 

Finally, the three-parameter model (4) includes yet another feature 
of the ICCs; when the ICC is not approaching the X-axis completely but 
smooths out on a horizontal line, (p above the X-axis, it is interpreted 
as 'the probability cannot be lower than (p due to guessing'. 

(3) p(a^, = 

In model (3) the item discrimination parametet^bx K must be 

estimated from data together with the 9 — and O parameters. 

In the three parameter model (4) the parameter (p, which measures 
the probability for obtaining a correct response by chance is introduced: 

1 T" e 

For a number of technical reasons the three-parameter model was 
abandoned as a basis for analysing the lEA data (the model is outside 
the family of exponential distributions and the interpretation of the 
parameters is ambiguous). A measure of the degree of guessing was, 
liowever constructed under the one-paranieter Rasch fuodel. 

When analysing the lEA data it became clear, that if the Rasch 
model ( 1 ) were to be the one and only theoretical frame reference 
for evaluating item homogeneity, quite a number of items would ha\ e 
been dropped from the test booklets. One Cjuite often met cause for 
misfit was the observation that the empirical ICCs across (independent) 
samples of data showed consistently biased values of the item 
discriminations, 8, i')eing significantly different from unity, which is the 
value recjuired by the Rasch model ( 1 ^. For this reason the reference 
model for the IFA item analyses became the two-parameter model (3)> 
provided that the item discriminations were not vi^v)' far from unity 



Test of Fit — Creating Three International Reading 
Scales 

Statistical tests of fit were conducted by means of conditional inference, 
where the set of ability parameters are eliminated by conditioning 
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(student scores are sufficient statistics for the abilities). Some test statis- 
tics are referring to all items siniultaneously but most of the statistics 
are referring to one item at a time using the Item Characteristic Curves 
(ICCs) or, more precisely using CICCs which are the conditional ICCs, 
the probability of a correct response f^iveri the value of the student 
score, which is dependent on complex functions of the item difficulties 

but independent of the ability parameters a,, v = 1 n. 

The test statistics used in the study for testing the Rasch model (1) 
or the two-parameter model (3) were the following: 



1. DIST — distance measure 

For each item and for each score level the observed number 
of students with correct responses to the item are compared 
with the model expected number. 

2. DIFINT 

This is the internal difference lietwecn item difficulties estimated 
from the 50 per cent lowest scoring students and the 50 per 
cent highest scoring students. DIFINT should be (stochastically) 
ecjual to zero if the model fits the data. 

3. TESIIAPE 

Tliis is the test of shape of the item characteristic curve 1CC.* 
ft is tested if the ICC is compatible with the one-parameter 
Rasch model or, alternatively, the two-parameter model with 
item discrimination different from unity. If this test is rejected 
the ICC is beyond any 'reasonable' shape for the analyses (e.g., 
decreasing probability of correct respon.se with increasing 
ability). 

4. TFKXT 

This is the test of external variable, viz.. lest of sex bias. A 
Fisher exact test (conditional on score group) compares the 
frecjuency of correct responses across sex in a 2 by 2 contin- 
gency table. A ciii-S(|uare statistic aggregated across all score 
groups is calculated. 

5. THFTOT 

This is the theta paran^eter for the total data set, i.e., the estim- 
ated item difficulty for each item; items must have consistent 
item difficulties across all countries. Although one item could 
be homogeneous irithifi a country, it may happen that the item 
shows varying difficulties r/cms\ different countries, and students 
in one country could, then, profit from such nonhomogeneity 
when comparing student abilities. In such ca.ses the item should 
be omitted from tiie International Reading Scale. 
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6. ITDISCR — item discrimination 

If the item discrimination (slope of ICC) is unity it confonns 
with the one-parameter Rasch model (1). If not, one might 
accept the item for inclusion in the international scale — pro- 
vided the TESHAPE test is not significant, the item discrimina- 
tion is not ver}' far from unity and consistent values across all 
countries is ohser\ed. The estimation of student abilities can 
then take place in accordance with the two-parameter model. 

The results of applying the test statistics N(\s, 1-6 to the item re- 
sponses withiu each of the three domains were that some items had to 
be ciropped from all further analysis because they revealed misfit using 
most of the lest statistics. Such items were not considered for inclusion 
in The InlernaticMial Reading vScale, Besides, some items had to be 
dropped hoiw specific countries, mainly due to vary ing item difficulties 
across countries. Such items could be included in the reference set of 
items for The International Reading .Scale and he used in the specific 
ccumlries. According to simple Rasch model (1 ) the use of different sets 
of items for estimation of student abilities across different countries is 
not a statistical problem, since, as said, driy subset of a homogeneous 
set of items can be used for the estimation of student abilities, Of 
course, the accuracy by which student abilities are estimated is depend- 
ent on the number of items (but ver>' little on the level of difficulties). 

By these procedures, tiiree subsets of the original sets of items 
were identified as the three reference sets of items called the Interna- 
tional Reading Scales, 

Iisfiniation of Student Ahilities 

The estimation of student abilities takes place after having estimated 
the domain-specific item parameters appn)ved by the six lest statistics. 
The maximum likelihood ecjuations for estimating the student-ability 

parameters a, v = 1 n are then the following (S) and ((>) — showed 

for the one and the two-parameter model, respectively: 

(S) r = y r=l K-1 

^ri + e«*«^ 

(()) r-Y : r-5, YS, -Min(6,) 

If a student fails to respond to an item in hotuvcu two other items, 
obviously considered (correct or non correct), one may aj^rec to 'non 
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correct' as a fair coding of such a 'missing* response. If, on the other 
hand a series of 'missings' are obser\'ed tn the end of a test session, 
with no intermediate correct or non-correct responses, one would cer- 
tainly have as a valid hypothesis, that the student has never considered 
these items — maybe tx'cause of lack of lime. The not reached' items 
can, accordingly, be identified by simply coimting backwards from 
the end of the test, imtil an item is obserx^ed, which is either correct or 
non-correct, 

vSeveral views on wh.ether *not reached' items would be properly 
scored as non-correct or 'missing' were forwarded during the discus- 
sion of the statistical analyses: a purely statistical judgment of the not 
reached' items is, howe\'er available, by asking, whether the test of fit 
of ilie model (being at the same time a test of item homogeneity) 
accepts' a scoring of the not reached' items as non-correct (zero) re- 
sponses. The .structure of item homogeneity (= accepted model) 'pre- 
dicts' the expected values of the 'not-reached' items and ordinary 
comparisons between obserxed and expected responses can then be 
undertaken. 

The data files containing all data originally collected by each coun- 
tiy and the estimated student abilities based on both 'principles' of 
evaluating the 'not reached' items were later on sent back to each 
country. All international analyses, and the first international report in 
particular were, however, based on the coding of 'not reached' as non- 
correct responses (Elley, 1992). One reason for the low rank of e.g., 
Denmark and Germany (E) in the leading league tables (see Figure 
12.3) in the report can be ascribed to this mixing up of speed and 
comprehension. 

Simple Coniparisofis of Students i}i Jhiily-two Coioithes 

I 'sing the estimation technique described above, estimated student 
abilities for approximately 1()(),()0() students in Fop. A and lOO.OOO .stu- 
dents in Fop. B were obtained — yielding three domain-specific estim- 
ates. For technical reasons the measures of abilities o, v = 1 n 

were re-scaled to mean = SOO and standard de\'iation = 100 on an 
international basis. Six tables arising from three domains by two 
populations emerge directly for each countr\', and Figure 12.3 shows, 
as an example, how the student abilities for the Narrative domain 
Pop. A were presented as ranked performances for twenty seven of the 
thirty-two participating countries (Elley, ibid.). I'sing Bonferroni 
adjusted-confidence limits for the e\'aluation of differences, the picture 
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Figure 12.3: Ranking of average student performances for twenty seven of the thirty-tv^'O countries, population A, narrative domain 



Mean 100 





Grade 


Age 


Finland 


3 


9.7 


United States 


4 


10.0 


Sweden 


3 


9.8 


New Zealand 


5 


10.0 


Italy 


4 


9.9 


France 


4 


10.1 


Norway 


3 


9.8 


Singapore 


3 


9.3 


Iceland 


3 


9.8 


Ireland 


4 


9.3 


Greece 


4 


93 


Belgium (FR) 


4 


9.8 


Switzerland 


3 


9.7 


Slovenia 


3 


9.7 


Canada (BC) 


3 


8.9 


Spain 


4 


10.0 


Hungary 


3 


9.3 


Hong Kong 


4 


10.0 


Netherlands 


3 


92 


Cyprus 


4 


9.8 


Germany.fW) 


3 


9.4 


Portugal 


4 


10.4 


Germany (E) 


3 


94 


Denmark 


3 


97 


Tnnidad/Tobago 


4 


9.6 


Indonesia 


4 


108 


Venezuela 


4 


10.7 
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leaves no doubt of significant differences between the levels of per- 
formance across the countries. 

Conclusive comparisons between the thiity-two countries by means 

of the directly estimated student abilities a,, v = 1 n were, however 

inappropriate considering, among others, the highly distinctive socio- 
economic conditions in which the various national educational systems 
are embedded. As a consequence, the crude comparisons of the abili- 
ties between countries were initially modified by accepting six different 
indicators of national de\'elopment: 

• GNF per capita: 

• public expenditure of per student on education: 

• life expectancy; 

• per cent low birth rate; 

• newspapers per 1,000; and 

• per cent adult literacy as a framework for comparing the a,.\s. 

In practice, the six indicators were combined (ec|ual weight) into o}ie 
CDI-index value (Composite Development Index) which was, then, 
through a simple linear regression of ability on CDI u.^^ ' ^o calculate 
CV)\- expected reading performances. This is done in order to see if, on 
average, a country* is performing 'less' or better* than expected by the 
value of the CHI index. Some countries, by tliis exercise, could then 
'move' from a relatixely low position to a position where a global 
judgment of the results would be 'good' or, even 'better than expected'. 



Readitig Pe}joym(nice in Relatio}i to Baek^nnoid Factois 

The backgroimd factors include a number of x ariables, collected through 
ordinaiy questionnaires, de\'ised to re\'eal systematic Ixickground in- 
formation at the level of analysis given by a student, at the level of a 
class and, e\en, at the level of a school. A first issue was, therefore, 

how one could relate /;/r//V7V///r// student performance a,, v = 1 n 

to background information defined on otlier levels of analysis. A rigorous 
solution would, e.g., require an appropriate definition of the per- 
formance of a ///// class — when analyses on the level of classes are in 
(juestion. The practical solution was, however lo relate to cnvrci^e 
student reading performances, v\henc\'er performance was related to 
background — the average calculated across the relevant 'unity*, it 
being within a class or across all students in a school. 

The statistical analyses of relationship lietween student perform- 
ance and background factors used a mixture of se\'eral strategies but 




Source Elley. 1992 
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underlying all attempts was the fact that performance measured by 
Ravsch scores a,, v = 1, . , . , n was related to background factors one by 
one. By this, one could, unfortunately, be overlooking the likelihood 
that several background factors themselves are intercorrelated. In fact, 
in order to illustrate this point, imagine that the correlation between a 
variable A and a background variable H is studied, and data on a third 
background variable C are available: 



B = 12 3 4 
A = 1 396 840 1273 11S3 

A = 2 102 198 424 381 for C = 1 and C = 2 together 

p = 0.001 significant correlation between A 

and B (Chi-square test) 
— same table sub divided according to C: 



C = 1 

B = 12 3 4 

A = 1 30 i 666 894 720 
A = 2 38 85 93 84 

p = 0.60 



C = 2 

H = 12 3 4 

92 174 379 433 
64 113 321 297 

p = 0.14 



The (marginal) correlation between A and B is seen to be signifi- 
cant (p = 0.001) while analysed for each of the two levels of the third 
variable C = 1 or C - 2, the correlation between A and B becomes 
nonsignificant (p = 0.60 and p = 0.14). This situation is described as 
\X)ndiiional independence between A and B\ i.e., the simple correla- 
tion between A and B (p = 0.001) can be 'explained' by the third 
variable C. It will be part of future statistical analyses to conduct such 
multivariate analyses; the initial statistical analyses in the first interna- 
tional report are all built upon marginal analy.ses (Elley, 1992). 

One attempt to investigate the influence of educational factors on 
the national-achievement levels was to select a series of background 
variables which were (4 pnori known to \'ar>' across the thirt\'-two 
countries. The idea was simply to isolate the values from ihe ten highest 
scoring countries and the ten knvest scoring countries from these vari- 
ables in order to look for differences. The differences between high 
and low-scoring countries were expressed in standardized tenns (rela- 
tive to a- standard deviation), since the scales behind each background 
variable differ in nature. The top of the table summarizing these analy- 
ses for Pop. A is shown in Table 12.2. 
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Table 12.2: Differences between ten highest and ten lowest-scoring countries 



Policy 


Highest 


Lowest 




Proportion 


Advantage 


indicator 


10 


10 


Diff/SD 


of SD 


for. . . 




countries 


countries 








1 Starting age of 












instruction 


6.3 yrs 


5.9 yrs 


0.40/0.65 


0.62 


Later start 


2 % Students in 










Preschool 


pre-schoo! 


68.7 


53.8 


14.9/31.36 


0.48 


enrollment 


3 Class size in sample 


25.1 


24.9 


0.2/5.72 


0.03 


No difference 


4 % Fennale teachers in 












sannple 


79.7 


76.3 


3.4/16.53 


0.20 


No difference 


5 School days per year 


178.9 


191.6 


12.7/19.08 


0.67 


Shorter year 


6 Hours instruction per 










More hours 


week 


21.7 


20.6 


1.1/3.49 


0.32 


instruction 


7 Phonic regularity of 










Less regular 


language 


2.6 


3.2 


0.6/1.21 


0.50 


language 


8 % Multigrade classes 


20.3 


24.4 


3.1/23.2 


0.18 


No difference 


9 Years teaching this 












class 


1.57 yrs 


1 .56 yrs 


0.01/0.60 


0.02 


No difference 


10 Years teacner 










More education 


education 


13.80 


12.6 


1.2/1.75 


0.69 


for teachers 


1 1 % Other language 












speaking teachers 


10.7 


9.8 


0.9/21.32 


0.04 


No difference 


12 Easy access to books 










Easy access 


in connnnunity 


74.2 


62.0 


12.2/15.2 


0.80 


to books 


13 Size of school 










Large school 


library 


3.50 


2.56 


0.94/1 .97 


0.48 


library 


14 Large classroom 










Large classroom 


libraries 


45.6 


22 8 


22.8/22.7 


1.00 


library 


15 Textbooks per student 


1.74 


1.59 


0.15/0.47 


0.32 


More textbooks 


16 Frequency borrow 










More books 


books from library 


3.25 


2.95 


0.30/0.51 


0.59 


borrowed 


17 Time on teaching 










More time on 


the language 


8 12 hrs 


7 07 hrs 


1.05/2.41 


0.44 


language teaching 


18 Frequency Stient 












reading in class 


3.43 


3 36 


0.09/0.93 


0.08 


No difference 


19 Frequency teachers 










More teacher 


read to class 


2.76 


2.25 


0.51/1.18 


0 43 


reading to class 


20 % teachers give 












frequent reading tests 


38.0 


51.4 


13.4/29.0 


0.46 


Fewer tests 


21 Speed of word 










Faster word 


recognition 


86.9 


77 4 


9.53/6.32 


1 51 


recognition 


Note: No adjustments for GDI 



Source: Elley, 1992 



Interestingly, a similar table based on the CDI-adjusted perform 
ances ean be eonstriicted and, as expected by parallel arguments for- 
warded for the AJVC table, a different picture emerged. A summarv' of 
the conclusions for Pop. A with and wilhoiil C!DI-adjiistments is shown 
in Table 12..^. 

Whenever a difference between the lop-len and the lowest ten 
counlries was evaluated for a specific background factor, a lenlatixe 
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Table 12.3: Evaluation of differences between ten highest and ten lowest-scoring 
countries 



Population A Population B 

Advantage shown for: Unadjusted Adjusted Unadjusted adjusted 









for CDI 




tor LUi 


1 . 


Smaller classes 




no 


yes 




2. 


More female teachers 




yes 


no 




3. 


Shorter school year 


yes 


yes 


yes 


yes 


4. 


More hours per week 


yes 


yes 


no 




5. 


More years teacher education 


yes 


yes 


yes 


yes 


6. 


More textbooks per student 


yes 




yes 


no 


7. 


Larger school library 


yes 


yes 


yes 


yes 


8. 


More formal tests 


no 


yes 


no 




9. 


Earlier starting age 


no 


yes 






10. 


High preschool enrollment 


yes 








11. 


More multigrade classes 










12. 


More years with same class 




no 






13. 


More books in community 


yes 








14. 


Larger classroom library 


yes 


yes 






15. 


More library books borrowed 


yes 


yes 






16. 


More time teaching language 


yes 


yes 






17. 


More silent reading in class 




yes 






18 


More teacher reading to class 




yes 






19 


Students faster word recognition 


yes 


yes 






20. 


Better pupil-teacher ratio 






yes 




21. 


Principal evaluates teachers more often 






no 




22. 


More general homework given 






yes 


yes 


23. 


More reading homework given 






no 




24. 


More resources for reading 






yes 


yes 


25. 


More individual tuition 






yes 


yes 



Note. With and without adjustments for CDI 
Source: Elley. 1992 



l-lcsl. irsin^ 0.30 as cut point, was applied for classifying the particular 
difference as important' or not important*. 

More conventional statistical approaches were used when factors 
like gender and home versus school language were analysed; in fact, 
such anaK ses were conducted as ordinar\' two sample t-tests. The re- 
sults of analysing gender differences in Vop. A are presented in Table 
12.1. 

Another .set of important background factors for understanding 
differences in achiex ement lexels is defined by size of .school libraiy*. 
nutnber of books in home', hours of daily T\' watching.' degree of 
\'oluntar\' reading' imd a variable meiLsuring how urbanized the envi- 
ronment of the school is (on a .scale with 'village* at the one end and 
city' at the other end). 

Typically, such anaty.ses give ri.se to stati.sticat judgments where 
whole 'curves' arc analy.sed. .see Tigure li.S for an example. 
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Table 12.4: Statistical evaluation of gender differences in score levels, population A 





Average score (s.e ) 




Standard 


Country 






Difference 


score 




Boys 


Girls 




difference 


Belgium/Fr 


503 (4.5) 


512 (4.5) 


9 


.12 


Canada/BC 


495 (5.4) 


506 (5.4) 


ir 


14 


Cyprus 


479 (3.2) 


484 (3.2) 


5 


.07 


Denmark 


463 (5.5) 


489 (4.9) 


26* 


.34 


Finland 


564 (4.5) 


575 (4.5) 


11* 


.14 


France 


530 (5.7) 


533 (5 6) 


3 


.04 


Germa ny/E 


490 (6.3) 


509 (6.1) 


19* 


24 


GermanyA'V 


501 (3.9) 


508 (3 8) 


7 


09 


Greece 


499 (4.4) 


510 (4.2) 


11* 


.14 


Hong Kong 


512 (3.7) 


524 (3.6) 


12* 


.15 


Hungary 


495 (3.8) 


504 (3.6) 


9 


.11 


Iceland 


508 (0 0) 


528 (0.0) 


20* 


.24 


Indonesia 


394 (3.6) 


397 (3.7) 


3 


04 


Ireland 


502 (5.2) 


517 (5 0) 


15* 


.19 


Italy 


525 (5.2) 


537 (5 1) 


12' 


15 


Netherlands 


483 (5.4) 


488 (5 2) 


5 


06 


New Zealand 


519 (4.1) 


539 (4.0) 


20* 


.25 


Norway 


517 (4 6) 


533 (4 0) 


16* 


.18 


Portugal 


474 (4.5) 


483 (4,5) 


9 


1 1 


Singapore 


510 (1.3) 


521 (1.3) 


11* 


14 


Slovenia 


491 (3.3) 


506 (3.^) 


15* 


19 


Spain 


500 (3 4S 


508 (3 3) 


8* 


10 


Sweden 


533 (4.4) 


546 (4.3) 


13* 


16 


Switzerland 


507 (4.2) 


517 (4.2) 


10* 


13 


Tnnidad/Tobago 


443 (4 3) 


460 (4.1) 


17* 


21 


United States 


543 (3 6) 


552 (3.4) 


9* 


11 


Venezuela 


379 (4 2) 


392 (3 9) 


13* 


16 



Note * = Significant difference ( 05 level) 
Source Elley. 1992 



In Figure 12. S a couple of countries are displayed having similar 
cur\e patterns; other patterns (e.g., constant curxes) could also be iden- 
tified, and a discussion was undertaken of the distinctive shape of the 
cur\e. valid for a group of countries with the specific cur\*e pattern, in 
relation to the levels of student performance. 



Messages for Teachers 

In the introduction to Hoir in the world do studoits rcad'f (l-lley. 1W2) 
a brief summaiy of the findings in the first international screening of the 
data is presented. If one would have hoped for the finding of a few. 
important ke\' factors which could account for most of the \ariance 
found in the student achievement lewis around the world, one would 
have been disappointed. It turned out different. No single variable or 
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Figure 12.5. Pattern of decreasing reading achievement with increasing datly TV' viewing, 
oopulatton A 

600 n 1 



550 



500 




— #— Gernnany (W) 



0 hrs 0 5 hrs 1 5 hrs 2 5 nrs 3 5 hrs 4 5 hrs 5 5 hrs 
Source Eiley, 1992 

small group ot variables can explain' why the siudenls perfonued well 
or not so well: future analyses will, however proceed to investigate 
more details in the national-data sets and may reveal structures which 
provide useful information to national educational planners. 

The principal content of the executive summaiy was the following: 

1. National achie\'ement levels 

The students of Finland showed the highest reading literacy 
levels at both 9 and 1 4 years of age in almost all domains, 

2. Domain profiles 

The levels of reading literacy achieved in each countrx* are 
highly correlated across all three domains, and across both 
age groups. 
^, Hconomic and social context 

For most countries, the levels of reading literacy are closely 
related to their national indices of economic development, 
health, and adult literacy. 
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4. Home language 

The students of Singapore acliiex'ed high levels of literacy in 
spite of the fact that they were instructed in a non-native 
language from the beginning of their schooling. This finding 
is unexpected and potentially important. 

5. Age of beginning instruction 

Formal instruction did not begin until age 7 in four of the ten 
highest scoring countries at each level. Apparently a late start 
is not a serious handicap in reading instruction, when judged 
at age 9. However, when achievement scores were adjusted 
for economic and social circumstances across all countries, 
an earlier start was generally found to be an advantage. 

b. Differences between high and low-scoring countries. 

Factors which consistently differentiated high-scoring and low- 
scoring countries were large school libraries, large classroom 
libraries, regular book borrowing, frequent silent reading in 
class, frequent story reading aloud by teachers, and more 
scheduled hours spent teaching the language. vSeveral coun- 
tries with low scores reported veiy little experience with for- 
mal tests, but above a threshold le\'el, this factor was not 
found to differentiate high and low-achieving countries. 

"7, Less important differentiating factors 

At the 9-year-old level, no perceptible advantage was found 
in reading literacy levels of countries which had high enrol- 
ment ratios in preschool, or generally smaller classes, or large 
numbers of inultigrade classes, or longer school years, or 
policies of keeping the teachers with the same class through 
successive grades. While many of these policies were found 
reguiady in high-scoring countries, the data suggest that their 
importance may well be only a function of relative affluence 
and community factors outside the school. 

8, Gender differences 

Girls achieved at higher levels than boys in all countries in 
population A, and in most countries in population B. 

9, Language differences 

Ghildren whose home language is different from that of the 
school show lower literacy levels in all countries at both age 
levels. 

10. I'rban-rural differences 

Trban children achieve at higher levels than rural children in 
most education systems. However, in a few highly developed 
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countries, rural students show literacy levels which are as 
good as. or even better than their cit\' age mates, 

11, Importance of books 

The availability of books is a key factor in reading literacy. 
The highest scoring countries typically prcn ide their students 
with greater access to books in the home, in nearby commun- 
ity libraries and book stores, and in the school, 

12, Links with television 

Television viewing occupies much of students out-of-.school 
discretionary' time. In a few countries large numbers of chil- 
dren watch TV for more than fi\'e hours per day. Those who 
watch T\' often tend to score at lower le\'els than who watch 
less, as a general rule. 
15, Voluntarv' reading 

The amoimt of \'oluntary out-of-school liook reading that stu- 
dents report is positively related to their achie\'ement levels. 
The relationship is clearer at the 9-year-old le\'el. and in the 
developing countries at age \ {, 



Conclusion 

The technical underpinnings of the IRA study of reading literacy* ha\'e 
been briefly outlined. Within the text are included references that will 
enable the reader to follow up the ideas presented. As in so much of 
education, for both teachers and pupils, one of the best ways of check- 
ing wliether or not a particular line of argument has been understood 
is to see whether one can explain and justify it to others. 
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Children Learning to Read: International Concerns 

Volume 2: 
Curriculum and Assessment Issues: 
Messages for Teachers 

Edited by 
Pamela Owen and Peter D Pumfrey 

This two volume series provides an international perspective on how children 
learn to read. Research studies and classroom experiences from around the 
worid are reported highlighting implications for design, implementation and 
evaluation of classroom reading programmes. Contributions are included 
from the UK. USA. Canada. Australia. Jamaica and Israel, and evidence is 
drawn from 18 countries. The juxtapositioning of evidence from different 
contexts, cultures and languages within a single volume provides a spectrum 
of experience which represents a snapshot of theory and practice in relation to 
worid literacy. 

Optimising pupils' reading attainments depends crucially on professionals' 
understanding of the conditions that facilitate reading as a thinking process. 
These two volumes address the three major common concerns of those 
involved in how children become literate and by what means such 
achievements may be appraised. They are: developing understanding of the 
nature of children's eariy reading development; considering ways in which 
children's reading can be encouraged; and the assessment of reading 
standards. 

The two volume series is distinctive by virtue of It being international in its 
orientation; interdisciplinary with contributions from professionals from 
complementary fields; addressing the interests and needs of teachers and 
mentors; and examining the Internationa! Issues of accountability. 

Pamela Owen worked as a primary teacher and as a secondary English teacher before 
moving into literacy research at the Centre for Formative Assessment Studies. School of 
Education, University of Manchester. There she worked on the development of a 
literacy assessment scheme and then moved in to the development of SAT English 
material for the STAIR consortium. Since 1991 she has been Senior Lecturer at St 
Martin's College with responsibility for the design and delivery of reading and language 
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